Job Description:
We are looking for a Senior DevOps Engineer with 5+ years of experience who can take ownership of large-scale cloud infrastructure, design highly resilient and cost-optimized architectures, and guide the evolution of DevOps culture and practices within the organization.
As a senior member of the DevOps team, you'll be a key technical leader driving cloud strategy, automation, observability, and security. You'll work closely with stake holders, developers, QA, and Product owners to deliver reliable, scalable, and secure solutions—while mentoring and upskilling the next generation of DevOps engineers at Mallow.
Responsibilities
- Solution Architect, implement, and maintain scalable, secure, and highly available AWS cloud infrastructure across multiple environments.
- Design and manage CI/CD pipelines to support reliable, repeatable deployments with zero-downtime strategies (blue/green, canary, rolling).
- Deploy and orchestrate containerized applications using AWS ECS (Fargate/EC2), Kubernetes (EKS/self-hosted), and Docker.
- Develop and maintain Infrastructure as Code (IaC) using tools like Terraform, CloudFormation, and Ansible to automate infrastructure provisioning and configuration.
- Implement and enhance monitoring, logging, and alerting systems using CloudWatch, Prometheus, Grafana, ELK stack, and OpenTelemetry.
- Optimize infrastructure for performance, scalability, and cost-efficiency, regularly reviewing usage and recommending improvements.
- Ensure infrastructure and application security best practices, including IAM, VPC security, secret management (AWS SSM/Vault), and compliance with internal standards.
- Troubleshoot complex infrastructure and networking issues, leading root cause analysis and resolution of production incidents.
- Collaborate with cross-functional teams including backend, frontend, QA, and product to streamline development and deployment workflows.
- Mentor and support junior and mid-level DevOps engineers, conduct code and infrastructure reviews, and contribute to internal documentation and knowledge sharing.
Requirements
- 5+ years of hands-on experience in DevOps, Site Reliability Engineering, or Cloud Infrastructure roles.
- Proven expertise in AWS Cloud Services, including ECS (Fargate/EC2), EKS/Kubernetes, Lambda, API Gateway, S3, DynamoDB, and VPC.
- Strong experience with containerization and orchestration using Docker and Kubernetes (managed or self-hosted).
- Deep knowledge of Infrastructure as Code (IaC) using Terraform, CloudFormation, or Ansible.
- Proficiency in CI/CD pipeline design and implementation using tools like GitLab CI/CD, Jenkins, AWS CodePipeline, or ArgoCD.
- Solid scripting skills in Python, Bash, or similar languages for automation and tooling.
- In-depth understanding of cloud networking, security best practices, IAM policies, and secrets management (SSM, KMS, Vault).
- Experience implementing observability solutions (monitoring, logging, alerting) with tools like CloudWatch, ELK Stack, Prometheus, Grafana, or OpenTelemetry.
- Demonstrated ability to troubleshoot and resolve complex infrastructure, deployment, and networking issues.
- Strong communication and collaboration skills; able to work closely with developers, QA, and leadership.
- AWS Certification (e.g., DevOps Engineer – Professional, Solutions Architect – Professional) is a strong plus.
Pay: ₹1,000,000.00 - ₹2,000,000.00 per year
Benefits:
- Internet reimbursement
- Provident Fund
Work Location: In person