Harness is led by technologist and entrepreneur Jyoti Bansal, founder of AppDynamics (acquired by Cisco for $3.7B). The company has raised ~$240M in Series E venture funding, is valued at $5.5B, and backed by top investors including Goldman Sachs, Menlo Ventures, IVP, Google Ventures, J.P. Morgan, Capital One Ventures, Citi Ventures, ServiceNow, Splunk Ventures and more. Harness is building the industry’s leading AI-powered software delivery platform, enabling teams worldwide to build, test, and deliver software faster, safer, and more reliably. Writing code is only 30–40% of the engineering lifecycle — the rest involves testing, deployments, security, compliance, and optimization. Harness brings AI and automation to this outer loop, turning complex, time-consuming workflows into streamlined processes at massive global scale.
The platform includes industry leading products in CI/CD, Feature Flags, Cloud Cost Management, Service Reliability, Chaos Engineering, Software Engineering Insights, Internal Developer Experience, and API discovery, observability, governance, and runtime protection. Over the past year, Harness powered 128M deployments, 81M builds, 1.2T API calls protected, and $1.9B in cloud spend optimized, helping customers like United Airlines and Choice Hotels accelerate releases by up to 75% and achieve 10x DevOps efficiency. With employees in over 25 countries, Harness is shaping the future of AI-driven software delivery — and we’re looking for exceptional talent to help us move even faster.
Position Summary
As a Senior DevOps Engineer in the Customer Engineering organization at Harness, you will operate at the intersection of Forward Deployment Engineering and Cloud Engineering—enabling secure, production-grade deployments for customers adopting our Bring Your Own Cloud (BYOC) model. You will help customers run Harness and Traceable platforms within their own infrastructure while achieving SaaS-level reliability, scalability, and operational excellence.
In this role, you will design, build, and operate cloud and hybrid infrastructure across diverse customer environments (AWS, Azure, GCP), working closely with the Cloud Engineering team to align deployments with platform best practices. You will leverage modern CI/CD practices to streamline deployments, Infrastructure as Code (IaC) to ensure repeatability, and robust observability frameworks to maintain deep visibility into system health and performance. Applying strong Site Reliability Engineering (SRE) principles, you will ensure high availability, seamless upgrades, and proactive incident management.
Key Responsibilities
Cloud Infrastructure Design & Implementation:
- Design, build, and manage scalable, secure, and reliable cloud infrastructure using GCP, AWS or Azure.
- Develop infrastructure-as-code using tools such as Terraform, CloudFormation, or similar.
Site Reliability Engineering (SRE):
- Implement SRE practices to ensure the reliability, availability, and performance of cloud services.
- Develop and maintain monitoring, logging, and alerting systems to detect and address issues proactively.
- Perform capacity planning and demand forecasting to ensure system scalability and performance.
Automation & CI/CD:
- Deploy, manage, and scale applications using Kubernetes (K8s).
- Utilize Helm for packaging, deploying, and managing applications on Kubernetes.
- Design and implement continuous integration and continuous deployment (CI/CD) pipelines to automate the delivery of applications and infrastructure.
- Develop automation scripts and tools to streamline operations and improve efficiency.
Security & Compliance:
- Ensure cloud infrastructure and applications meet security and compliance standards.
- Implement security best practices and perform regular security audits and assessments.
Collaboration & Mentorship:
- Collaborate with cross-functional teams including developers, product managers, and operations to deliver high-quality solutions.
- Mentor and guide junior engineers, sharing best practices and fostering a culture of continuous improvement.
About You
Technical Expertise:
- Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
- 3+ years of experience in cloud engineering, site reliability engineering, or related roles.
- Strong experience with cloud platforms (AWS, GCP, Azure) and cloud-native services.
- Proficiency in infrastructure-as-code tools (Terraform), Helm package manager and configuration management tools (Ansible, Chef, Puppet)
- Experience with AI-OPS
SRE Practices:
- Experience with SRE principles, including error budgets, SLIs, SLOs, and incident management.
- Strong knowledge of monitoring and observability tools (Prometheus, Grafana, GCM).
Automation & DevOps:
- Expertise in building and managing CI/CD pipelines using tools like Jenkins, GitLab CI, CircleCI or Harness
- Strong coding skills (Python, Go, etc.) and familiarity with version control systems (Git).
Security & Compliance:
- Understanding of security best practices for cloud infrastructure and applications.
- Experience with compliance frameworks (ISO 27001, SOC 2, PCI DSS) is a plus.
Soft Skills:
- Excellent problem-solving skills and attention to detail.
- Strong communication and collaboration skills, with a proactive and innovative approach.
- This role will be out of our Bangalore office on a Hybrid capacity.