Working at Citi is far more than just a job. A career with us means joining a team of more than 230,000 dedicated people from around the globe. At Citi, you’ll have the opportunity to grow your career, give back to your community and make a real impact.
The SRE Observability Specialist is a hands-on expert, delivering the future of Observability across Services Technology. This role is a part of a central SRE enablement team within Services Production, working closely with SREs, developers, and platform teams to embed telemetry, implement SLOs, and build meaningful visualizations for key production flows — particularly in critical Payments Business.
The ideal candidate will have deep technical knowledge, a collaborative mindset, and the ability to translate strategy into scalable engineering outcomes. You will also act as a bridge between Services Technology teams and central infrastructure/CTO teams, prioritising observability needs from line-of-business teams and driving improvements. A strong understanding of observability tooling, evolving AI/ML capabilities, and enterprise tooling ecosystems will be essential.
This role requires providing technological Support solution for Function called Project Orion which provides End-to-End payment monitoring like Building an End-to-End payments Dashboard, Toil Reduction, Transformation of legacy monitoring into observability based monitoring solution, requires good understanding of different Payments Taxonomy (ACH, Wires, Instant Payments, etc.). Strong commercial awareness, technical credibility, and excellent communication skills are essential to negotiate internally, influence peers, and drive change. Some external communication may be necessary.
Key Responsibilities:
Qualifications:
- 7+ years of experience in SRE, Observability Engineering, or platform infrastructure roles focused on operational telemetry.
- Hands-on experience in observability tools and stacks such as Grafana, Prometheus, OpenTelemetry, ELK, Splunk, and similar platforms.
- Deep understanding of SLIs, SLOs, Error Budgets, and telemetry best practices in high-availability environments.
- Proven ability to troubleshoot integration issues and support observability across hybrid platforms (on-prem, cloud, containers).
- Experience building dashboards aligned to business outcomes and incident workflows, especially in critical flows like payments.
- Familiarity with modern observability tooling ecosystems, including AI/ML capabilities, trace correlation, baselining, and alert tuning.
- Strong interpersonal and collaboration skills — able to operate across federated engineering teams and central infrastructure groups.
- Experience in enablement or platform teams with a track record of scaling best practices across diverse business units.
Education:
- Bachelor’s degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience.
-
Technology
-
Applications Support
-
Full time
-
Please see the requirements listed above.
-
For complementary skills, please see above and/or contact the recruiter.
-
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.