Innovation Hub Overview
Jefferies is creating a Technology Innovation Hub in Pune, a greenfield opportunity to build the systems that power global markets. As our first India technology center, this hub brings together hands on builders who engineer the platforms behind Jefferies’ growth across capital markets, investment banking, and institutional securities. We’re scaling toward an elite team of 500 engineers while maintaining the agility, ownership, and meritocratic spirit that defines Jefferies. From cloud and data to AI, risk, and core business technologies, teams in Pune will lead high impact work with a global mandate.
IT Infrastructure Technology
Jefferies’ IT Infrastructure Technology team builds and runs the core technology backbone that enables the firm to operate globally. It covers enterprise infrastructure engineering and operations while driving modernization initiatives such as Cloud Adoption and Network Resiliency. The group supports critical platforms including networking, end-user computing, cloud, databases, server/Unix engineering, communications, and global data centers (including low-latency colocations near exchanges).
Role summary
Own reliability and operational excellence for Public Cloud and Kubernetes platforms in AWS (EKS). You will drive automation with Terraform and GitOps pipelines via Argo CD and other automation tooling and improve observability and incident response maturity. This Lead Kubernetes and Cloud SRE Engineer will be a hands-on technical leader with deep AWS SRE experience, proven cloud architecture skills, and strong scripting ability (Python or Go). You will operate and evolve our AWS EKS-based Kubernetes platform, drive automation via Terraform and Argo CD, and partner closely with Cloud Engineering, Cloud Architecture, Cloud Platform, and Cloud Networking teams. The platform may expand to additional cloud providers in the future.
Key responsibilities
- Operate and improve Kubernetes platforms on AWS EKS as an admin: cluster lifecycle support, upgrades, scaling, capacity planning, and standardization.
- Work closely with Cloud Architecture, Cloud Platform, Cloud Networking, and Security to resolve systemic issues across identity, networking, DNS, ingress, certificates, and platform guardrails.
- Participate in a 24x7 on-call rotation (local timezone), typically 12-hour shifts for one week at a time, every 12 weeks; lead incident triage, coordinate responders, and drive restoration.
- Provide architectural input to ensure services are production-ready (fault tolerance, blast-radius reduction, safe deployments, runbook readiness, and clear SLOs). Influence reference architectures and “golden paths” for workload onboarding.
- Build and maintain IaC with Terraform (modules, state workflows, environments); Terraform Enterprise experience is a strong plus.
- Implement and operate GitOps deployment patterns using CI/CD tools like Argo CD, Flux, Jenkins (access controls, environment promotion, drift detection, app patterns).
- Improve observability: actionable alerting, dashboards, SLO/SLI reporting, and reduction of noise/toil.
- Troubleshoot complex issues spanning Kubernetes and AWS infrastructure (ingress, DNS, IAM/RBAC, node lifecycle, autoscaling, storage classes, cluster performance).
- Drive post-incident reviews and ensure corrective actions are delivered and tracked to completion with blameless retrospectives and actionable solutions
- Mentor junior engineers and contribute to team standards, documentation, and operational readiness.
- Required skills / experience
Requirements
- 7-10+ years in SRE/platform/DevOps with production ownership.
- Strong Kubernetes operations knowledge (networking basics, ingress/controllers, autoscaling, RBAC, storage, troubleshooting)
- Strong Cloud (preferred AWS) experience and penchant for multi-cloud interest: A Cloud certification in Associate or Professional levels is preferred (AWS Cloud Solutions Architect Associate/Professional, Azure Administrator Associate/Expert)
- Demonstrated cloud architecture experience designing resilient, secure, and operable systems (availability, DR patterns, scaling, and cost-aware design).
- Proficiency in Python or Go (plus shell scripting) with a track record of automation and toil reduction.
- Strong experience with Terraform (modules, state, environment patterns); CI/CD/Git workflows.
Bonus (highly valued)
CKAD certification
Terraform Enterprise (workspaces, guardrails/policies, team workflows)
AKS/GKE exposure (for potential future multi-cloud expansion)
Agentic AI experience in an operational capacity
We have been made aware of bad actors falsely claiming to be associated with Jefferies Group soliciting individuals to attend virtual job interviews, complete online tests or courses and sending fictitious employment offer letters.
Please note that any email contact with Jefferies personnel will come from an “@jefferies.com” email address. Further, Jefferies will not notify shortlisted candidates through social media platforms (e.g. WhatsApp or Telegram) or ask candidates to make payment to participate in the hiring process.
#LI-MF1