Level: L2 / Mid
Location: Noida, India
Type: Full-time
About Us:
CLOUDSUFI, a Google Cloud Premier Partner, is a global leading provider of data-driven digital transformation across cloud-based enterprises. With a global presence and focus on Software & Platforms, Life sciences and Healthcare, Retail, CPG, financial services and supply chain, CLOUDSUFI is positioned to meet customers where they are in their data monetization journey.
Our Values
We are a passionate and empathetic team that prioritizes human values. Our purpose is to elevate the quality of lives for our family, customers, partners and the community.
Equal Opportunity Statement
CLOUDSUFI is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified candidates receive consideration for employment without regard to race, colour, religion, gender, gender identity or expression, sexual orientation and national origin status. We provide equal opportunities in employment, advancement, and all other areas of our workplace. Please explore more at https://www.cloudsufi.com/
Responsibilities
- Deploy and instrument AI agents for testing and monitoring.
- Set up observability — traces, logs, and metrics — and continuous evaluation on production traffic.
- Build drift detection and alerting so quality regressions are caught early.
- Own the cloud infrastructure and CI/CD automation supporting the platform.
- Manage environments, access, and operational reliability.
Required qualifications
- 3+ years in DevOps, MLOps, or platform engineering.
- Strong Google Cloud experience (Cloud Run, Cloud Build, Pub/Sub, Cloud Logging and Monitoring, IAM) or equivalent cloud platform.
- Experience with CI/CD automation and infrastructure as code.
- Familiarity with deploying LLM or AI agent workloads.
Preferred qualifications
- Experience with OpenTelemetry and AI/agent observability.
- Exposure to Vertex AI, Gemini Enterprise Agent Platform, or ADK.