Senior AI/ML Engineer (LLM, Kafka & Kubernetes)

Rytsense Technologies
Chennai, Tamil Nadu

Quick apply

Job details

Permanent | Full-time
2 days ago

Qualifications

CI/CD
Azure
Law
MCP
Kubernetes
Spark
Master's degree
Databases
AWS
Docker
Machine learning
Continuous integration
Apache
Kafka
AI
Python

Full job description

Job Title: Senior AI/ML Engineer (LLM, Kafka & Kubernetes)

Location: Chennai
Employment Type: Full-Time
Experience: 4+ Years

About the Role

We are looking for a highly skilled Senior AI/ML Engineer with strong expertise in Generative AI, Large Language Models (LLMs), Apache Kafka, Kubernetes, and MLOps. The ideal candidate will be responsible for designing, developing, and deploying enterprise-grade AI solutions capable of processing real-time streaming data at scale.

Key ResponsibilitiesAI & Machine Learning

Design and develop Generative AI applications using LLMs.
Build Retrieval-Augmented Generation (RAG) systems for enterprise use cases.
Develop AI agents and multi-agent workflows.
Fine-tune, evaluate, and optimize AI models.
Build production-grade inference pipelines.
Implement prompt engineering and AI evaluation frameworks.

Data Streaming & Event-Driven Systems

Develop real-time AI applications using Apache Kafka.
Design and implement event-driven architectures.
Process streaming data for AI workflows and automation.
Build monitoring and alerting mechanisms for AI pipelines.

MLOps & Infrastructure

Deploy and manage AI workloads on Kubernetes.
Build scalable model serving infrastructure.
Manage model lifecycle, deployment, and versioning.
Optimize GPU utilization and inference performance.
Implement CI/CD pipelines and observability practices.

Cloud & Enterprise AI

Deploy AI solutions on AWS, Azure, or GCP.
Design secure and scalable enterprise AI architectures.
Develop microservices-based AI applications.
Collaborate with product and engineering teams to deliver AI-driven solutions.

Required SkillsAI/ML

4+ years of hands-on experience in AI/ML development.
Strong proficiency in Python.
Experience with LLMs including GPT, Claude, Gemini, Llama, Mistral, and Qwen.
Experience building RAG applications and AI assistants.
Strong understanding of embeddings, retrieval, reranking, and model evaluation.
Experience with Vector Databases such as Pinecone, Weaviate, Qdrant, or Milvus.

AI Frameworks

LangGraph
LangChain
LlamaIndex
CrewAI (Preferred)
MCP (Model Context Protocol)

Apache Kafka (Mandatory)

Production experience with Apache Kafka.
Strong understanding of Topics, Partitions, Consumer Groups, Event Streaming, and Kafka Connect.
Experience building real-time AI systems using streaming architectures.

Kubernetes (Mandatory)

Experience deploying AI workloads on Kubernetes.
Knowledge of Deployments, StatefulSets, Services, Ingress, and Autoscaling.
Experience managing containerized AI applications in production.

MLOps

Docker
Kubernetes
MLflow
CI/CD Pipelines
Model Monitoring
Observability Tools

Nice to Have

Apache Flink
Ray
vLLM
NVIDIA Triton Inference Server
Confluent Platform
Databricks
Apache Spark
TensorRT
GPU Optimization

What You'll Build

Enterprise AI Agents
AI Copilots
Real-Time AI Inference Systems
Streaming AI Applications
Intelligent Document Processing Systems
Enterprise Knowledge Assistants
Multi-Agent AI Platforms
Agentic Workflow Automation

Preferred Candidate Profile

We are looking for someone who has successfully built and deployed production-grade AI systems with strong expertise in Kafka and Kubernetes, along with hands-on experience in Generative AI and enterprise-scale AI deployments.

Notice Period: Immediate to 30 Days Preferred
Compensation: Best in Industry Based on Skills and Experience.

Work Location: In person

Quick apply

Jobseeker tools

Employer Tools

Browse

Stay Connected