We are looking for a highly skilled and hands-on Generative AI Engineer with strong expertise in Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), vector databases, and scalable AI system development. The ideal candidate should have experience building production-grade AI applications using modern GenAI frameworks and cloud technologies.
Role Overview
The candidate will design, develop, and deploy advanced AI solutions including conversational AI systems, intelligent search platforms, AI assistants, and enterprise-grade RAG pipelines. The role requires deep understanding of LLM orchestration, prompt engineering, semantic retrieval, backend API integration, and scalable AI architectures.
Key Responsibilities
- Design and develop end-to-end Generative AI applications using LLMs such as GPT, LLaMA, and transformer-based models.
- Build scalable Retrieval-Augmented Generation (RAG) pipelines integrating vector databases and embedding models.
- Develop AI-powered chatbots, assistants, semantic search systems, and intelligent recommendation engines.
- Implement LangChain-based workflows, AI agents, function calling, and tool calling architectures.
- Integrate AI pipelines with backend systems, APIs, databases, and enterprise applications.
- Optimize prompts, retrieval quality, context management, reranking, and inference performance.
- Develop backend AI services using FastAPI and Python-based microservices.
- Work with vector databases such as Milvus, ChromaDB, and FAISS for semantic search applications.
- Deploy and manage AI workloads on cloud platforms including AWS, Azure OpenAI, and SageMaker.
- Collaborate with product, engineering, and business teams to convert business requirements into AI solutions.
- Perform model evaluation, monitoring, and continuous optimization for production systems.
- Build scalable and secure AI architectures suitable for enterprise environments.
Required Skills & QualificationsTechnical SkillsProgramming & Backend
- Strong proficiency in Python and SQL
- Experience with Object-Oriented Programming (OOP)
- Expertise in FastAPI and backend API development
Generative AI & LLMs
- Strong understanding of:
- Large Language Models (LLMs)
- GPT, LLaMA, BERT
- Prompt Engineering
- RAG Architectures
- AI Agents
- Semantic & Hybrid Search
- Embedding Models
- Context Management
- LLM Application Development
AI Frameworks & Databases
- Experience with:
- LangChain
- Milvus
- ChromaDB
- FAISS
- Vector Search Systems
Machine Learning & Deep Learning
- Experience in:
- NLP
- Predictive Analytics
- Classification & Regression Models
- ANN, RNN, LSTM, Transformer Models
- Model Evaluation & Optimization
Data & Visualization
- Hands-on experience with:
- Pandas
- NumPy
- EDA & Feature Engineering
- Tableau
- Matplotlib / Plotly
Cloud & Deployment
- Experience with:
- AWS
- Azure OpenAI
- Amazon SageMaker
- GPU-based inference environments
- vLLM or similar inference frameworks
Version Control
- Git / Bitbucket
- Code reviews and collaborative development workflows
Preferred Experience
- 4–6+ years of experience in AI/ML/Data Science
- Hands-on experience in production-grade Generative AI systems
- Experience in legal AI, healthcare AI, or conversational AI domains
- Exposure to scalable LLM serving and inference optimization
- Experience with hybrid search and reranking pipelines
- Knowledge of enterprise AI deployment practices
Education
- Bachelor’s degree in Engineering, Computer Science, Data Science, AI/ML, or related field
Preferred Candidate Profile
- Strong analytical and problem-solving abilities
- Ability to independently own AI modules end-to-end
- Excellent communication and collaboration skills
- Passion for cutting-edge AI technologies and innovation
- Ability to work in fast-paced product environments
Nice to Have
- Experience with AI Agents and autonomous workflows
- Experience with fine-tuning or serving open-source LLMs
- Knowledge of scalable inference systems and GPU optimization
- Familiarity with enterprise AI governance and security practices
Pay: ₹500,000.00 - ₹1,400,000.00 per year
Work Location: Hybrid remote in Bengaluru, Karnataka