About Us:
CLOUDSUFI, a Google Cloud Premier Partner, is a global leading provider of data-driven digital transformation across cloud-based enterprises. With a global presence and focus on Software & Platforms, Life sciences and Healthcare, Retail, CPG, financial services and supply chain, CLOUDSUFI is positioned to meet customers where they are in their data monetization journey.
Our Values
We are a passionate and empathetic team that prioritizes human values. Our purpose is to elevate the quality of lives for our family, customers, partners and the community.
Equal Opportunity Statement
CLOUDSUFI is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified candidates receive consideration for employment without regard to race, colour, religion, gender, gender identity or expression, sexual orientation and national origin status. We provide equal opportunities in employment, advancement, and all other areas of our workplace. Please explore more at https://www.cloudsufi.com/
Lead. MLOps Engineer
We are looking for Lead MLOps Engineer, who will play a crucial role in building and scaling machine learning operations at AI COE. This position will be 30% leadership and 70% hands-on, ensuring both strategic oversight and direct involvement in MLOps infrastructure design, automation, and optimization. You will lead a team while collaborating with various stakeholders to manage machine learning pipelines and model deployments in Google Cloud Platform (GCP). One of key parts of this role would also managing data and models using data cataloging tools, ensuring that they are well-documented, versioned, and accessible for reuse and auditing.
About the job
Lead(30%) and Engineer(70%) AI developed models to production in GCP and owning the model maintenance, monitoring and support activities.
Split time between high-level strategy and hands-on technical implementation.
Architect, build, and maintain scalable MLOps pipelines, with a focus on Google Cloud Platform (GCP) services such as Vertex AI, GKE, Cloud Storage, and BigQuery.Stay up-to-date with the latest trends and advancements in MLOps.
Implement and optimize CI/CD pipelines for machine learning model deployment, ensuring minimal downtime and streamlined processes.
Work closely with data scientists and data engineers to ensure efficient data processing pipelines, model training, testing, and deployment.
Manage data catalog tools for model and dataset versioning, lineage tracking, and governance. Ensure that all models and datasets are properly documented and Discoverable.
Develop automated systems for model monitoring, logging, and performance tracking in production environments.
Lead the integration of data cataloging tools (e.g., Open Meta Data), ensuring the traceability and versioning of both datasets and models.
About you
We are looking for a unique and amazing talent, who brings along the following:
University degree/ Education, preferably in Mathematics, Statistics, Computer Science, Physics or similar.
Minimum 6 years of professional experience in a MLOps roles within an international setting with a minimum of 2 years of Project Lead Experience.
Excellent analytical and problem-solving skills for technical challenges related to MLOps.
Excellent English proficiency, presentation, and communication skills
Proven experience in deploying, monitoring, and managing machine learning models on Google Cloud Platform (GCP).
Hands-on experience with data catalog tools.
Expert in Google Cloud in GCP services such as Vertex AI, GKE, BigQuery, and Cloud Build, Endpoint etc for building scalable ML infrastructure (GCP official Certifications are a huge plus)
Experience with model serving frameworks (e.g., TensorFlow Serving, TorchServe), and MLOps tools like Kubeflow, MLflow, or TFX.