THIS IS A NIGHTSHIFT ROLE THAT REQUIRES YOU TO WORK IN THE U.S HOURS
Senior Cloud Infra Ops Engineer (AWS & Azure)
We are expanding our multi-cloud strategy across AWS and Azure and strengthening our Cloud Infrastructure Operations (CloudOps) organization to support enterprise-scale operations, governance, reliability, and modernization initiatives.
The Opportunity
We are seeking a Senior Cloud Infrastructure Operations Engineer to join our CloudOps organization and help operate, automate, secure, and optimize enterprise cloud environments across AWS and Azure.
This role is ideal for an engineer with strong operational and infrastructure experience who thrives in large-scale production environments and enjoys solving complex cloud operational challenges.
You will play a key role in:
- Operating and improving enterprise cloud platforms
- Driving automation and operational efficiency
- Enhancing cloud reliability, observability, and governance
- Supporting production infrastructure and incident response
- Partnering with Platform Engineering, Security, Networking, and Application teams
- Advancing enterprise cloud operational maturity and FinOps initiatives
If you enjoy cloud operations, automation, reliability engineering, and building scalable operational processes, this role is for you.
What You’ll Do
Cloud Infrastructure Operations
- Operate and support enterprise AWS and Azure cloud environments
- Manage highly available production infrastructure and critical business systems
- Maintain operational health, uptime, resiliency, and performance across cloud platforms
- Support cloud platform lifecycle activities including provisioning, upgrades, patching, and remediation
- Assist with disaster recovery readiness and operational resiliency initiatives
Monitoring, Observability & Incident Management
- Monitor cloud environments using tools such as:
- AWS CloudWatch
- Azure Monitor
- Datadog
- PagerDuty
- Participate in on-call rotations and production incident response
- Troubleshoot infrastructure, networking, and platform issues
- Lead root cause analysis (RCA) and implement long-term corrective actions
- Improve operational visibility through dashboards, alerting, and automation
- Help mature operational processes, escalation procedures, and incident response standards
Automation & Infrastructure as Code
- Develop and maintain Infrastructure as Code using Terraform
- Automate operational tasks, provisioning workflows, and remediation processes
- Support CI/CD and GitOps operational practices
- Build reusable operational tooling and scripts to improve efficiency and reduce manual work
- Collaborate with Platform Engineering teams on cloud automation standards
Cloud Governance & Security Operations
- Support enterprise cloud governance standards across AWS and Azure
- Assist with cloud security remediation and compliance initiatives
- Support governance frameworks including:
- AWS Organizations and SCPs
- Azure Policy
- Partner with Security Engineering on vulnerability remediation, monitoring, and cloud security posture improvements
Networking & Platform Support
- Support cloud networking components including:
- VPCs / VNets
- Routing
- VPNs
- Connectivity and DNS
- Assist with operational support for Kubernetes platforms:
- Amazon EKS
- Azure AKS
- Support containerized workloads and platform services
FinOps & Cost Optimization
- Identify opportunities for cloud cost optimization and operational efficiency
- Assist with implementation and management of:
- AWS Savings Plans
- Reserved Instances (RIs)
- Azure Reservations
- Azure Savings Plans
- Support cloud usage analysis, reporting, forecasting, and governance initiatives
- Collaborate with leadership and finance teams on cloud optimization strategies
Collaboration & Operational Excellence
- Partner with Cloud Platform Engineering, Security, Networking, and Application teams
- Participate in operational reviews, architecture discussions, and platform improvement initiatives
- Develop operational documentation, standards, playbooks, and runbooks
- Mentor junior engineers and contribute to operational maturity improvements
- Promote operational best practices and automation-first culture
What You Bring
Required Qualifications
- 7+ years of experience in cloud infrastructure operations or cloud engineering
- Strong hands-on experience with AWS
- Experience supporting Azure cloud environments
- Strong experience with Terraform and Infrastructure as Code
- Experience supporting large-scale production cloud environments
- Strong troubleshooting and operational support experience
- Experience with monitoring and observability platforms
- Strong understanding of:
- Cloud networking
- IAM and identity management
- High availability and disaster recovery concepts
- Experience with Linux and cloud operational tooling
- Experience supporting Kubernetes platforms (EKS and/or AKS)
- Ability to work effectively during production incidents and operational escalations
Preferred Qualifications
- Experience with Datadog, PagerDuty, or similar operational platforms
- Experience with GitOps and CI/CD operational support
- Experience working within enterprise CloudOps or SRE environments
- Familiarity with cloud governance and compliance frameworks
- Exposure to FinOps and cloud cost optimization initiatives
- Experience working in multi-cloud enterprise environments
- AWS and/or Azure certifications preferred
Why Join Us
- Operate enterprise-scale AWS and Azure cloud platforms
- Work on high-impact cloud modernization and operational initiatives
- Help shape the future of enterprise CloudOps and cloud governance
- Collaborate with experienced engineering and leadership teams
- Drive automation, observability, reliability, and operational excellence
- Competitive compensation, benefits, and career growth opportunities
Work Environment
This is a remote role with occasional collaboration across distributed teams and business units. You’ll work in a fast-paced, highly collaborative environment focused on operational excellence, cloud modernization, automation, and enterprise reliability.
Pay: From ₹2,000,000.00 per year
Application Question(s):
- Do you have Expertise in the following - Terraform & Infrastructure as Code; Kubernetes (EKS / AKS) and container platforms; Experience with Linux;- Please Explain briefly.
Experience:
- AWS: 9 years (Required)
- Azure: 9 years (Required)
Shift availability:
- Night Shift (Required)
- Overnight Shift (Required)
Work Location: Remote