Role description
UST is seeking an experienced MLOps Lead to drive the design, implementation, automation, and operationalization of machine learning platforms and workflows. The ideal candidate will have strong expertise in Python development, Linux system administration, automation through Bash scripting, and experience in mentoring and guiding junior engineers and developers.
The role requires close collaboration with Data Scientists, Engineers, Platform Engineers, AI Architects, and Business Stakeholders to build scalable, reliable, and production-grade ML solutions.
Key Responsibilities
- Design, implement, and manage scalable MLOps platforms and CI/CD pipelines for ML workflows.
- Develop and maintain robust Python-based automation and orchestration solutions.
- Administer and troubleshoot Linux-based environments and infrastructure.
- Automate deployment, monitoring, retraining, and maintenance of machine learning models.
- Create and maintain Bash/Shell scripts for automation, system operations, deployments, and monitoring activities.
- Ensure platform reliability, security, scalability, performance, proper access control, and operational excellence.
- Collaborate with Data Science and Engineering teams to streamline ML operations and deployment processes.
- Mentor junior developers and engineers by providing technical guidance, design and code reviews, and promoting best practices.
- Implement monitoring, logging, ing, and performance optimization for ML systems.
- Drive technical standards, documentation, and operational governance across MLOps initiatives.
- Support troubleshooting and root cause analysis for production incidents and platform issues.
- Enforce promotion governance, version control practices, and establishment of golden datasets.
Required Skills & Qualifications
- 10+ years of overall IT experience with strong exposure to MLOps and Platform Engineering.
- Strong proficiency in Python programming.
- Hands-on experience with Linux System Administration.
- Strong scripting expertise using Bash/Shell scripting.
- Experience building automation frameworks and operational tooling.
- Experience working with Dataiku or similar platforms is preferred.
- Strong understanding of CI/CD pipelines and infrastructure provisioning.
- Experience working with observability and monitoring tools for ML platforms and infrastructure.
- Experience working with cloud and containerized environments such as EKS is preferred.
- Experience working with LLMs, Generative AI solutions, and Agentic AI frameworks is an added advantage.
- Strong problem-solving, troubleshooting, and analytical skills.
- Experience in implementing cost and resource optimization techniques for MLOps platforms and ML workloads.
- Experience working in Agile delivery environments.
Soft Skills
- Strong leadership skills with experience in mentoring junior developers and engineers.
- Excellent collaboration and communication skills.
- Ability to drive technical discussions and architectural decisions.
- Ownership mindset with focus on quality, timely delivery, scalability, and operational stability.
Skills
mlops,machine learning,python,bash scripting,linux,dataiku,kubernetes,sql,agentic ai,cicd,data science
About UST
UST is a global digital transformation solutions provider. For more than 20 years, UST has worked side by side with the world’s best companies to make a real impact through transformation. Powered by technology, inspired by people and led by purpose, UST partners with their clients from design to operation. With deep domain expertise and a future-proof philosophy, UST embeds innovation and agility into their clients’ organizations. With over 30,000 employees in 30 countries, UST builds for boundless impact—touching billions of lives in the process.