We are seeking a highly skilled and motivated DevOps Engineer to join our dynamic team. The ideal candidate will be responsible for designing, implementing, and maintaining our infrastructure and CI/CD pipelines, ensuring the seamless deployment and operation of our applications. This role requires a strong understanding of cloud platforms, automation tools, and best practices in software development and operations. You will work closely with development and operations teams to foster a culture of collaboration and continuous improvement.
- Design, implement, and manage robust and scalable CI/CD pipelines using tools like Jenkins, GitLab CI, or GitHub Actions to automate software delivery.
- Provision, configure, and manage cloud infrastructure (AWS, Azure, or GCP) using Infrastructure as Code (IaC) tools such as Terraform or CloudFormation.
- Implement and maintain containerization technologies, primarily Docker and Kubernetes, for application deployment and orchestration.
- Develop and maintain automation scripts using Python, Bash, or Go to streamline operational processes and reduce manual intervention.
- Monitor system performance, troubleshoot issues, and implement solutions to ensure high availability, reliability, and performance of applications and infrastructure.
- Collaborate with development teams to integrate new features and services into the existing infrastructure, ensuring scalability and security.
- Implement and manage logging, monitoring, and alerting solutions (e.g., Prometheus, Grafana, ELK Stack) to proactively identify and resolve operational issues.
- Ensure security best practices are integrated into the CI/CD pipelines and infrastructure, including vulnerability scanning and access management.
- Participate in on-call rotations to provide support for critical infrastructure and application issues.
- Document infrastructure, processes, and procedures to facilitate knowledge sharing and maintain operational consistency.
- Stay up-to-date with emerging technologies and industry trends to continuously improve our DevOps practices and tooling.
We are seeking an experienced Senior DevOps Engineer to design, implement, and manage scalable, secure, and highly available cloud infrastructure. The ideal candidate will have strong expertise in cloud platforms, Kubernetes, CI/CD automation, infrastructure as code, monitoring, security, and production operations.
This role requires a hands-on engineer who can drive automation, improve deployment reliability, optimize cloud costs, and support engineering teams throughout the software development lifecycle.
Key Responsibilities
Cloud Infrastructure
- Design, build, and maintain highly available cloud environments.
- Manage infrastructure across AWS, Azure, or GCP.
- Implement scalable and fault-tolerant architectures.
- Optimize cloud resource utilization and costs.
Kubernetes & Containerization
- Deploy and manage Kubernetes clusters in production environments.
- Configure autoscaling, ingress controllers, service meshes, and cluster security.
- Manage containerized workloads using Docker and Kubernetes.
- Troubleshoot cluster and application performance issues.
CI/CD & Automation
- Design and maintain CI/CD pipelines for automated build, test, and deployment processes.
- Implement GitOps practices using ArgoCD, FluxCD, or similar tools.
- Automate infrastructure provisioning and application deployments.
- Improve deployment frequency, reliability, and recovery times.
Infrastructure as Code
- Develop and maintain infrastructure using Terraform, CloudFormation, or Pulumi.
- Standardize infrastructure deployment processes.
- Implement reusable infrastructure modules and templates.
Monitoring & Observability
- Implement monitoring, logging, and alerting solutions.
- Manage tools such as Prometheus, Grafana, ELK, Loki, Datadog, or New Relic.
- Create dashboards and operational reports.
- Drive proactive incident detection and resolution.
Security & Compliance
- Implement DevSecOps best practices.
- Manage secrets, certificates, and access controls.
- Conduct vulnerability assessments and remediation.
- Ensure compliance with organizational security standards.
Production Support
- Participate in incident management and root cause analysis.
- Lead troubleshooting during critical production incidents.
- Improve platform reliability and operational excellence.
- Define and maintain SLAs, SLOs, and operational procedures.
Collaboration
- Work closely with Development, QA, Security, and Infrastructure teams.
- Mentor junior engineers and provide technical leadership.
- Drive DevOps best practices across the organization.
- AWS (EC2, EKS, ECS, RDS, Lambda, S3, IAM, CloudWatch)
- Azure or Google Cloud Platform experience is a plus
- Jenkins
- GitHub Actions
- GitLab CI/CD
- Azure DevOps
- ArgoCD
- Python
- Bash
- Shell Scripting
- Go (Preferred)