AI / ML Engineer
Experience: 2–8 Years | Type: Full-Time | Location: Pune, India (Remote / Hybrid / On-site)
Job Summary
We are hiring an AI/ML Engineer to design and deploy LLM-powered applications, AI Agents, and agentic automation systems. You will work across the full lifecycle — from prototyping to production — building intelligent solutions using the latest Generative AI tools and frameworks.
Key Responsibilities
- Build and deploy LLM-based applications using OpenAI, Anthropic Claude, Gemini, and Hugging Face models
- Develop AI Agents and multi-agent systems using LangGraph, CrewAI, AutoGen, and MCP (Model Context Protocol)
- Design and implement RAG pipelines with vector databases (Pinecone, Weaviate, ChromaDB, FAISS, Milvus)
- Perform prompt engineering, fine-tuning (LoRA/QLoRA), and model evaluation
- Build agentic workflows integrating external APIs, tools, and enterprise data via function/tool calling
- Monitor LLM pipelines using LangSmith and LangFuse; track costs, latency, and quality
- Deploy AI services using FastAPI, Docker, Kubernetes on AWS / Azure / GCP
- Follow MLOps and LLMOps best practices: versioning, CI/CD, monitoring, and drift detection
Required Skills
- Languages: Python (advanced), SQL
- AI/ML: Machine Learning, Deep Learning, Transformer architectures
- Generative AI: LLMs, Prompt Engineering, RAG, Fine-tuning (LoRA/QLoRA)
- Agents: AI Agents, Multi-Agent Systems, Agentic Workflows, MCP, Function Calling, Tool Calling
- Frameworks: LangChain, LangGraph, LlamaIndex, CrewAI, AutoGen
- LLM Providers: OpenAI, Anthropic Claude, Google Gemini, Hugging Face
- Observability: LangSmith, LangFuse
- Vector DBs: Pinecone, Weaviate, ChromaDB, FAISS, Milvus
- Infra: FastAPI, Docker, Kubernetes
- Cloud: AWS (Bedrock/SageMaker), Azure (OpenAI Service), or GCP (Vertex AI)
- LLMOps: Model evaluation, monitoring, deployment pipelines
Preferred Skills
- Hands-on experience with AI agent frameworks — LangChain, LangGraph, LlamaIndex, CrewAI, AutoGen, or similar
- Experience with MCP (Model Context Protocol) for agent integrations
- Exposure to AI evaluation frameworks and red-teaming practices
Qualifications
- B.E. / B.Tech / M.Tech in CS, AI/ML, Data Science, or equivalent
2–8 years in AI/ML engineering; 1–2+ years hands-on with LLMs or Generative AI
Pay: ₹400,000.00 - ₹2,000,000.00 per year
Benefits:
- Flexible schedule
- Internet reimbursement
- Paid sick time
- Paid time off
- Work from home
Work Location: Hybrid remote in Pune, Maharashtra (Pune, Pune District)