Project Role : AI / ML Engineer
Project Role Description : Develops applications and systems that utilize AI tools, Cloud AI services, with proper cloud or on-prem application pipeline with production ready quality. Be able to apply GenAI models as part of the solution. Could also include but not limited to deep learning, neural networks, chatbots, image processing.
Must have skills : Machine Learning Operations, DevOps, Large Language Models (LLMs)
Good to have skills : NA
Minimum 7.5 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
Pod: Pod 3 — Platform DevOps
Reports to: Engineering Manager, Platform Core (EM1)
Experience: 8–12 years
Location: Bangalore / US (flexible)
ROLE OVERVIEW: This is the tech lead role for Pod 3, the infrastructure layer that keeps the entire platform operational, releasable, and observable. You will own the agent runtime hosting environment, the CI/CD pipeline for platform releases, and the observability infrastructure.
Roles & Responsibilities:
Design and own the agent runtime hosting architecture — compute, scaling, isolation between agents, and resource governance
Build and operate the CI/CD pipeline for platform releases
Lead Pod 3's two DevOps/infrastructure engineers and release engineer
Own the platform observability stack
Define environment strategy: sandbox, staging, and production environments
Establish the platform's reliability contract
Partner with Security and Governance pod on infrastructure security
IDEAL PROFILE:
Has built and operated the infrastructure layer for a machine learning or AI platform in production holds strong opinions about CI/CD for platform engineering understands cost management dimension of LLM infrastructure has led a small infrastructure engineering team.
15 years full time education