Senior AI/ML Engineer – LLM, VLM & Generative AI

Programming.com -
Navi Mumbai, Maharashtra

Quick apply

Job details

Permanent | Full-time
1 day ago

Qualifications

TensorFlow
CI/CD
Image processing
Azure
Law
Kubernetes
PyTorch
Computer vision
Research
Master's degree
AWS
Docker
Machine learning
Continuous integration
Deep learning
RabbitMQ
Natural language processing
APIs
Kafka
Flask
AI
Python

Full job description

Job Title: Senior AI/ML Engineer – LLM, VLM & Generative AI
Location: Navi Mumbai - Ghansoli(Onsite)
Final Interview: Face to Face
Experience: 5+ Years
Employment Type: Full-Time

Job Summary

We are looking for a Senior AI/ML Engineer to build and deploy next-generation AI solutions powered by Large Language Models (LLMs), Vision Language Models (VLMs), and Generative AI. The ideal candidate should have strong expertise in core AI/ML, deep learning, multimodal AI, and backend engineering, with experience taking AI models from research to production.

Key Responsibilities

Design, develop, and deploy production-grade AI/ML solutions using Python and modern deep learning frameworks.
Build RAG (Retrieval-Augmented Generation) pipelines, agentic AI workflows, and enterprise knowledge retrieval systems.
Develop Vision Language Model (VLM) applications for image understanding, OCR, document intelligence, and multimodal AI.
Fine-tune and optimize open-source LLMs using LoRA, PEFT, and prompt engineering techniques.
Design embedding pipelines, vector database integrations, semantic search, and retrieval systems.
Build scalable backend services and AI APIs using FastAPI/Flask.
Implement model monitoring, evaluation, hallucination detection, guardrails, and performance optimization.
Collaborate with product, engineering, and data teams to deliver reliable AI solutions for production environments.

Required Skills

Strong Python programming skills with FastAPI or Flask.
5+ years of experience in AI/ML, Deep Learning, and production ML systems.
Hands-on experience with PyTorch, TensorFlow, Scikit-learn, and model deployment.
Strong expertise in LLMs, RAG, Prompt Engineering, Embeddings, Vector Databases, and Semantic Search.
Experience with LangChain, LlamaIndex, or similar AI orchestration frameworks.
Hands-on experience with Vision Language Models (VLMs) such as CLIP, BLIP, LLaVA, OCR, and multimodal AI.
Experience with Docker, Kubernetes, CI/CD, and cloud platforms (AWS, Azure, or GCP).
Knowledge of Kafka/RabbitMQ, asynchronous processing, and scalable AI architectures.

Preferred Skills

Experience with LoRA, PEFT, Quantization, vLLM, Triton Inference Server, or distributed inference.
Knowledge of AI Agents, Multi-Agent Systems, LangGraph, CrewAI, or AutoGen.
Experience with MLflow, Weights & Biases, Airflow, and MLOps practices.
Familiarity with GPU optimization, model serving, and production AI infrastructure.

Preferred Background

Experience building enterprise AI products from research to production.
Background in Computer Vision, NLP, Document AI, or Multimodal AI.
Experience working with scalable AI platforms in fintech, healthcare, enterprise SaaS, or regulated industries.

Application Question(s):

Are you an immediate Joiner?
Mention your last working day.
Available for final round Face to face ?
How many Years of experience in Vision Language Model?
Total year of experience in AI/ML?

Location:

Navi Mumbai, Maharashtra (Preferred)

Work Location: In person

Quick apply

Jobseeker tools

Employer Tools

Browse

Stay Connected