Apply Now
Location: Boston, Massachusetts (MA)
Contract Type: W2
Posted: 2 months ago
Closed Date: 02/28/2025
Skills: AWS Lambda, Kubernetes, Docker
Visa Type: GC EAD, GreenCard, H1B, H4 EAD, USC

Role: Gen AI Engineer

Location: Boston, MA

Experience: 12+ (Only GC, USC)

Industry: AI/ML, Enterprise Applications, Healthcare (if applicable) 

Role Summary: 

Seeking an experienced GenAI Engineer to integrate LLM APIs, build AI-driven applications, optimize model performance, and deploy AI servicesat scale. The ideal candidate has expertise in Python-based AI development, LLM orchestration, cloud deployment, and enterprise AI integration. 

Key Responsibilities: 

·        AI Application Development – Build and maintain Python-based AI services using LangChain, and CrewAI. Implement RAG-based retrievaland Agentic AI workflows

·        LLM Integration & Optimization – Integrate OpenAI, Bard, Claude, Azure OpenAI APIs. Optimize API calls using temperature, top-p, max tokens and reduce hallucinations using embedding-based retrieval (FAISS, Pinecone)

·        Model Evaluation & Performance Tuning – Assess AI models using Model Scoring, fine-tune embeddings, and enhance similarity search for retrieval-augmented applications. 

·        API & Microservices Development – Design scalable RESTful APIs services. Secure AI endpoints using OAuth2, JWT authentication, and API rate limiting

·        Cloud Deployment & Orchestration – Deploy AI-powered applications using AWS Lambda, Kubernetes, Docker, CI/CD pipelines. Implement LangChain for AI workflow automation. 

·        Agile Development & Innovation – Work in Scrum teams, estimate tasks accurately, and contribute to incremental AI feature releases

 

Tech Stack & Tools: 

 AI/ML: PyTorch, TensorFlow, Hugging Face, Pinecone 

 LLMs & APIs: OpenAI, LangChain, CrewAI 

 Cloud & DevOps: AWS, Azure, Kubernetes, Docker, CI/CD 

 Security & Compliance: OAuth2, JWT, HIPAA