Role: AI Platform Engineer
Location: Addison, TX/ Charlotte, NC- Onsite
No GC's or H1
Role Overview
We are looking for an AI Platform Engineer—a builder who can architect the "factory" where AI is made.
Our goal is to build an internal, on-premises AI ecosystem that mimics the capabilities of AWS or Azure. You will be responsible for creating a horizontal platform used by various lines of business to deploy AI projects simultaneously.
Key Responsibilities
- Platform Architecture: Design and develop a "Model-as-a-Service" platform that allows non-experts to use drag-and-drop components to build AI solutions.
- RAG-as-a-Service: Build and optimize end-to-end Retrieval-Augmented Generation (RAG) pipelines, including sophisticated chunking strategies and vector database management.
- Tooling & Libraries: Develop and maintain MCP (Model Control Protocol) libraries, clients, and servers to connect various data sources to the AI engine.
- Infrastructure Management: Help manage and optimize one of the largest on-premise GPU farms in the U.S. banking sector (500+ Nvidia nodes).
- Agentic AI: Build a repository for Agentic AI where users can select existing agents or build custom ones for specialized tasks.
- CI/CD Integration: Integrate AI deployment pipelines with enterprise-level CI/CD tools like Jenkins and Ansible.
- Compliance & Guardrails: Implement corporate-level guardrails and work within Model Risk Management (MRM) frameworks to ensure all AI deployments are secure and compliant.
Required Technical Skills
- Expert Python: Deep, hands-on knowledge is mandatory.
- Data Engineering: Extensive experience in massive data ingestion and processing.
- RAG Expertise: Deep understanding of vector databases, inferencing, and advanced chunking strategies.
- Platform Engineering: Proven experience building tools/platforms that other developers or business units use.
- Infrastructure Knowledge: Experience mimicking cloud capabilities (AWS/Azure) within a strictly on-premise environment.
- DevOps: Familiarity with Jenkins, Ansible, and automated deployment pipelines.