Deltek is looking for a Senior Software Engineer to join our Site Reliability Engineering team. In this role, you will be responsible for the reliability, scalability, and performance of our globally-used SaaS platforms. You will bridge the gap between software engineering and infrastructure operations, building the tools, automation, and systems that keep our products running for thousands of customers and millions of users.
This is a high-ownership role in a "never-stop-learning" environment. You will work closely with development teams to embed reliability practices early in the software lifecycle, respond to production incidents, and drive continuous improvements to our observability and operational posture.
Key Responsibilities:
Site Reliability & Platform Engineering
Observability & Performance
Design and maintain comprehensive observability solutions including logging, metrics, tracing, and alerting across our AWS-based infrastructure.
Incident Management & On-Call Support
Own post-incident reviews, facilitate blameless post-mortems, identify root causes, and ensure action items are tracked and completed.
Collaboration & Engineering Culture
Technology Stack: