Job Title: Senior AI/ML Engineer – LLM, VLM & Generative AI
Location: Navi Mumbai - Ghansoli(Onsite)
Final Interview: Face to Face
Experience: 5+ Years
Employment Type: Full-Time
Job Summary
We are looking for a Senior AI/ML Engineer to build and deploy next-generation AI solutions powered by Large Language Models (LLMs), Vision Language Models (VLMs), and Generative AI. The ideal candidate should have strong expertise in core AI/ML, deep learning, multimodal AI, and backend engineering, with experience taking AI models from research to production.
Key Responsibilities
- Design, develop, and deploy production-grade AI/ML solutions using Python and modern deep learning frameworks.
- Build RAG (Retrieval-Augmented Generation) pipelines, agentic AI workflows, and enterprise knowledge retrieval systems.
- Develop Vision Language Model (VLM) applications for image understanding, OCR, document intelligence, and multimodal AI.
- Fine-tune and optimize open-source LLMs using LoRA, PEFT, and prompt engineering techniques.
- Design embedding pipelines, vector database integrations, semantic search, and retrieval systems.
- Build scalable backend services and AI APIs using FastAPI/Flask.
- Implement model monitoring, evaluation, hallucination detection, guardrails, and performance optimization.
- Collaborate with product, engineering, and data teams to deliver reliable AI solutions for production environments.
Required Skills
- Strong Python programming skills with FastAPI or Flask.
- 5+ years of experience in AI/ML, Deep Learning, and production ML systems.
- Hands-on experience with PyTorch, TensorFlow, Scikit-learn, and model deployment.
- Strong expertise in LLMs, RAG, Prompt Engineering, Embeddings, Vector Databases, and Semantic Search.
- Experience with LangChain, LlamaIndex, or similar AI orchestration frameworks.
- Hands-on experience with Vision Language Models (VLMs) such as CLIP, BLIP, LLaVA, OCR, and multimodal AI.
- Experience with Docker, Kubernetes, CI/CD, and cloud platforms (AWS, Azure, or GCP).
- Knowledge of Kafka/RabbitMQ, asynchronous processing, and scalable AI architectures.
Preferred Skills
- Experience with LoRA, PEFT, Quantization, vLLM, Triton Inference Server, or distributed inference.
- Knowledge of AI Agents, Multi-Agent Systems, LangGraph, CrewAI, or AutoGen.
- Experience with MLflow, Weights & Biases, Airflow, and MLOps practices.
- Familiarity with GPU optimization, model serving, and production AI infrastructure.
Preferred Background
- Experience building enterprise AI products from research to production.
- Background in Computer Vision, NLP, Document AI, or Multimodal AI.
- Experience working with scalable AI platforms in fintech, healthcare, enterprise SaaS, or regulated industries.
Application Question(s):
- Are you an immediate Joiner?
- Mention your last working day.
- Available for final round Face to face ?
- How many Years of experience in Vision Language Model?
- Total year of experience in AI/ML?
Location:
- Navi Mumbai, Maharashtra (Preferred)
Work Location: In person