Candidate Skill:
DevOps, SRE, CI/CD, Terraform, Ansible, Docker, Kubernetes, AWS/Azure/GCP, Monitoring & Logging (Prometheus/Grafana), Linux, Scripting (Python/Shell), Problem-solving & Analytical Skills
Job Description:
We are seeking a skilled DevOps / Site Reliability Engineer (SRE) to build, automate, and maintain scalable, reliable infrastructure. The role focuses on system reliability, performance, monitoring, and CI/CD automation to ensure high availability of applications and services. Key Responsibilities: Design, implement, and manage CI/CD pipelines Automate infrastructure provisioning using IaC tools like Terraform/Ansible Monitor system performance, availability, and reliability Troubleshoot production issues and ensure quick resolution Manage cloud infrastructure across AWS, Azure, or GCP Implement logging, monitoring, and alerting systems Collaborate with development teams to improve system reliability and deployment processes Ensure security, scalability, and cost optimization of infrastructure Required Skills (1 line): DevOps, SRE, CI/CD, Terraform, Ansible, Docker, Kubernetes, AWS/Azure/GCP, Monitoring & Logging (Prometheus/Grafana), Linux, Scripting (Python/Shell), Problem-solving & Analytical Skills Qualifications & Experience: 5+ years of experience in DevOps/SRE roles Strong expertise in cloud infrastructure, automation, and system reliability