We are seeking a highly skilled and experienced Contractor - Senior Specialist Cloud SRE with a strong focus on AWS to join our dynamic team. As a Senior Specialist Cloud SRE, you will be instrumental in ensuring the reliability, scalability, and performance of our cloud-native applications and infrastructure on Amazon Web Services (AWS). You will apply Site Reliability Engineering (SRE) principles to build and maintain robust, automated systems, and drive operational excellence. This role requires a deep understanding of AWS services, infrastructure as code, monitoring, incident response, and a passion for automation.
- Design, implement, and maintain highly available, scalable, and resilient cloud infrastructure and applications on AWS.
- Develop and implement automation tools and frameworks to streamline operational processes, deployments, and infrastructure provisioning using Infrastructure as Code (IaC) principles (e.g., CloudFormation, Terraform).
- Monitor system performance, availability, and capacity using advanced monitoring and alerting tools (e.g., CloudWatch, Prometheus, Grafana).
- Proactively identify and resolve complex technical issues, performance bottlenecks, and potential points of failure within the AWS environment.
- Lead incident response efforts, conduct root cause analysis (RCA), and implement preventative measures to minimize future occurrences.
- Define, track, and improve Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets for critical services.
- Collaborate with development teams to ensure new features and services are designed for operational readiness, reliability, and observability.
- Implement and enforce security best practices and compliance standards across all AWS environments.
- Participate in on-call rotations to provide 24/7 support for critical production systems.
- Mentor junior SREs and contribute to knowledge sharing within the team.
- Evaluate and recommend new technologies and tools to improve the SRE practice and overall system reliability.