Job Title: Senior AI/ML Engineer (LLM, Kafka & Kubernetes)
Location: Chennai
Employment Type: Full-Time
Experience: 4+ Years
About the Role
We are looking for a highly skilled Senior AI/ML Engineer with strong expertise in Generative AI, Large Language Models (LLMs), Apache Kafka, Kubernetes, and MLOps. The ideal candidate will be responsible for designing, developing, and deploying enterprise-grade AI solutions capable of processing real-time streaming data at scale.
Key ResponsibilitiesAI & Machine Learning
- Design and develop Generative AI applications using LLMs.
- Build Retrieval-Augmented Generation (RAG) systems for enterprise use cases.
- Develop AI agents and multi-agent workflows.
- Fine-tune, evaluate, and optimize AI models.
- Build production-grade inference pipelines.
- Implement prompt engineering and AI evaluation frameworks.
Data Streaming & Event-Driven Systems
- Develop real-time AI applications using Apache Kafka.
- Design and implement event-driven architectures.
- Process streaming data for AI workflows and automation.
- Build monitoring and alerting mechanisms for AI pipelines.
MLOps & Infrastructure
- Deploy and manage AI workloads on Kubernetes.
- Build scalable model serving infrastructure.
- Manage model lifecycle, deployment, and versioning.
- Optimize GPU utilization and inference performance.
- Implement CI/CD pipelines and observability practices.
Cloud & Enterprise AI
- Deploy AI solutions on AWS, Azure, or GCP.
- Design secure and scalable enterprise AI architectures.
- Develop microservices-based AI applications.
- Collaborate with product and engineering teams to deliver AI-driven solutions.
Required SkillsAI/ML
- 4+ years of hands-on experience in AI/ML development.
- Strong proficiency in Python.
- Experience with LLMs including GPT, Claude, Gemini, Llama, Mistral, and Qwen.
- Experience building RAG applications and AI assistants.
- Strong understanding of embeddings, retrieval, reranking, and model evaluation.
- Experience with Vector Databases such as Pinecone, Weaviate, Qdrant, or Milvus.
AI Frameworks
- LangGraph
- LangChain
- LlamaIndex
- CrewAI (Preferred)
- MCP (Model Context Protocol)
Apache Kafka (Mandatory)
- Production experience with Apache Kafka.
- Strong understanding of Topics, Partitions, Consumer Groups, Event Streaming, and Kafka Connect.
- Experience building real-time AI systems using streaming architectures.
Kubernetes (Mandatory)
- Experience deploying AI workloads on Kubernetes.
- Knowledge of Deployments, StatefulSets, Services, Ingress, and Autoscaling.
- Experience managing containerized AI applications in production.
MLOps
- Docker
- Kubernetes
- MLflow
- CI/CD Pipelines
- Model Monitoring
- Observability Tools
Nice to Have
- Apache Flink
- Ray
- vLLM
- NVIDIA Triton Inference Server
- Confluent Platform
- Databricks
- Apache Spark
- TensorRT
- GPU Optimization
What You'll Build
- Enterprise AI Agents
- AI Copilots
- Real-Time AI Inference Systems
- Streaming AI Applications
- Intelligent Document Processing Systems
- Enterprise Knowledge Assistants
- Multi-Agent AI Platforms
- Agentic Workflow Automation
Preferred Candidate Profile
We are looking for someone who has successfully built and deployed production-grade AI systems with strong expertise in Kafka and Kubernetes, along with hands-on experience in Generative AI and enterprise-scale AI deployments.
Notice Period: Immediate to 30 Days Preferred
Compensation: Best in Industry Based on Skills and Experience.
Work Location: In person