Tech Lead - Infrastructure Services N
Ready to turn bold ideas into real-world impact?
At Genpact, we don’t just adapt to change, we lead it. AI and digital innovation are transforming the way businesses work, and we’re at the forefront of it. Genpact’s AI Gigafactory, our industry-first accelerator, exemplifies how we scale advanced technology solutions to help global enterprises work smarter, grow faster, and transform at scale. Whether tackling complex challenges through large-scale models or agentic AI, our breakthrough solutions tackle companies’ most complex challenges.
If you thrive in a fast-moving, innovation-driven environment, love building and deploying cutting-edge AI solutions, and want to push the boundaries of what’s possible, this is your moment.
Genpact (NYSE: G) is an agentic and advanced technology solutions company. We leverage process intelligence and artificial intelligence to deliver measurable outcomes. With a strong partner ecosystem and decades of client trust, we provide innovative solutions that transform how businesses run. Powered by a team with an active learning mindset and client centricity at its core, we deliver lasting value for the world’s leading enterprises.
Get to know us at genpact.com and on LinkedIn, YouTube, X, and Facebook.
Job Description
Observability Platform Engineering
-
Design, implement, and maintain Datadog-based observability solutions across infrastructure, platforms, and applications.
-
Develop and optimize dashboards, monitors, and alerts to support proactive detection and triage of performance and reliability issues.
-
Integrate custom telemetry pipelines (metrics, logs, traces, events) aligned with Open Telemetry and platform architecture standards.
-
Manage instrumentation strategies to ensure accurate and consistent coverage across services.
2. Site Reliability Engineering (SRE) Practices
-
Apply SRE principles to improve service reliability, availability, and performance.
-
Define and track SLIs, SLOs, and SLAs for critical systems, and build feedback loops to continuously enhance service health.
-
Automate manual operational processes using Python, Terraform, or CI/CD tooling.
-
Collaborate with development and platform teams to identify resilience patterns and embed observability by design.
3. Datadog Expertise & Ecosystem Enablement
-
Serve as the subject matter expert (SME) for Datadog — advising on advanced configurations, integrations, and performance optimization.
-
Enable distributed tracing, APM, RUM, and synthetics capabilities to support end-to-end visibility.
-
Implement and maintain Datadog Terraform configurations, templates, and governance models for enterprise consistency.
-
Conduct performance tuning and cost optimization for Datadog usage across global environments.
4. Incident & Problem Management
-
Partner with the Operations and Platform teams to analyze incident patterns and provide root cause insights through observability data.
-
Lead post-incident reviews and recommend observability-driven improvements to prevent recurrence.
-
Build automation and correlation mechanisms for real-time alert enrichment and contextual diagnostics.
5. Continuous Improvement, R&D, and Automation
-
Proactively identify gaps, inefficiencies, and manual workflows within the observability ecosystem and design automation-first solutions.
-
Research, prototype, and evaluate new observability patterns, tools, and techniques, including AI- and agent-based approaches, before scaling them into production.
-
Build reusable frameworks, templates, and toolkits to reduce toil and enable self-service adoption across engineering teams.
-
Continuously improve observability signal quality, alert precision, and operational efficiency through experimentation and iteration.
-
Translate learnings from incidents, postmortems, and usage data into systemic improvements rather than one-off fixes.
Qualifications we seek in you!
Minimum Qualifications
-
Bachelor’s degree in Computer Science, Information Systems, or a related field.
-
Experience in observability engineering or SRE roles within large-scale distributed systems.
Preferred Qualifications/ Skills
-
Deep, hands-on expertise with Datadog, including APM, Logs, Metrics, RUM, and Synthetics.
-
Strong proficiency in:
-
Infrastructure as Code (IaC): Terraform
-
Automation: Python, Bash, or similar scripting languages
-
CI/CD pipelines: Jenkins, GitLab, or GitHub Actions
-
Experience supporting multi-cloud environments (AWS, GCP, Azure).
-
Familiarity with container orchestration (Kubernetes, ECS) and service mesh observability.
-
Understanding of data visualization and analytics for operational reporting.
-
Exposure to AI-driven observability enhancements or integration with LLM-based insights (a plus).
-
Certification in Datadog, AWS, or GCP is advantageous.
Qualifications
Bachelors - Computer Engineering, Bachelors - Database Management, Bachelors - Information Systems, Bachelors - Information Technology, Bachelors - Network Engineering
Certifications
Certified Information Systems Security Professional (CISSP) - Workforce Academy OnlineWorkforce Academy Online, Cisco Certified Network Professional (CCNP) - Netmetrix SolutionNetmetrix Solution
Required Skills
Agile Methodology, AWS DynamoDB, Datadog, GCP Dataflow, GitHub, Monitoring Tools, Terraform
Language
English (Required)
Language Proficiency -
Proficient - C2
Additional Job Location -
Job Type
Regular
Master Skill List -
Infrastructure Services N
Remote Type -
Hybrid
Work Shift -
Flex Time (India)
Why join Genpact?
- Lead AI-powered transformation – Drive innovation and solve real-world business challenges that matter
- Make an impact – Help global enterprises solve business challenges that matter
- Accelerate your career – Gain hands-on experience, mentorship, and world-class learning opportunities to stay ahead
- Work with the best – Join 140,000+ bold thinkers and problem-solvers who push boundaries every day
- Thrive in a values-driven culture – Our courage, curiosity, and incisiveness - built on a foundation of integrity and inclusion - allow your ideas to fuel progress
Come join the 140,000+ coders, tech shapers, and growth makers at Genpact and take your career in the only direction that matters: Up.
Let’s build tomorrow together.
Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color, religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values respect and integrity, customer focus, and innovation.
Furthermore, please do note that Genpact does not charge fees to process job applications and applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a 'starter kit,' paying to apply, or purchasing equipment or training.