Job Title: MLOps Engineer (GenAI Infrastructure)
Location: India (Remote)
About the Role
We are hiring highly skilled MLOps Engineers to work on cutting-edge Generative AI infrastructure. This is not a traditional MLOps role—this position focuses on deep AI systems engineering, large-scale training infrastructure, and performance optimization.
You will be part of an advanced AI environment, contributing to the development and optimization of LLM training systems and next-generation machine learning platforms
Key Responsibilities
- Design, build, and optimize ML training infrastructure for large-scale models (LLMs)
- Improve performance of distributed training pipelines
- Develop and evaluate advanced MLOps workflows and systems
- Work on low-level optimization using Triton or Pallas (GPU kernel-level)
- Collaborate with research and engineering teams to enhance AI system efficiency
- Analyze and troubleshoot performance bottlenecks in ML systems
Required Skills & Qualifications
- 2+ years of experience in MLOps / ML Systems Engineering
- Strong hands-on experience with JAX and/or PyTorch
- Experience with Triton or Pallas for GPU kernel development
- Solid understanding of distributed systems and ML training infrastructure
- Experience working with large-scale machine learning pipelines
- Strong problem-solving and analytical skills
- Ability to clearly communicate technical concepts and decisions
Preferred Qualifications
- Experience working with Large Language Models (LLMs)
- Knowledge of GPU optimization and performance tuning
- Exposure to deep learning frameworks and compiler-level optimizations
- Prior experience in high-performance computing environments
Pay: ₹50,000.00 - ₹70,000.00 per month
Work Location: Remote