We’re looking for a Senior/Lead DevOps Engineer who will own and evolve AWS and hybrid (on-prem Kafka cloud) infrastructure, ensuring stable, scalable, and cost-efficient operations. The role involves driving secure and automated delivery processes by improving CI/CD pipelines with quality gates, implementing Infrastructure-as-Code (Terraform/CloudFormation) and enhancing observability and incident response. The engineer will also proactively improve system reliability, performance, and security using modern AI-assisted DevOps tooling.
Requirements:
- Strong hands-on experience with AWS (EC2, Lambda, RDS, S3, networking, IAM);
- Solid CI/CD experience (GitLab CI/CD; GoCD or similar tools strongly preferred);
- Proven experience with Infrastructure-as-Code (Terraform and/or CloudFormation);
- Strong understanding of security best practices (least privilege, access control, secrets handling);
- Experience with monitoring and observability tools (logs, metrics, alerting);
- Experience supporting event-driven architectures (Kafka, SNS/SQS);
- Good Linux/system administration and troubleshooting skills;
- Practical experience using AI-assisted tools (e.g., Codex/Copilot/Claude, or similar) to accelerate DevOps workflows;
- Strong ownership mindset with ability to work autonomously and proactively;
- Excellent communication skills with proven track record of regular updates and transparency
Responsibilities:
- Own and evolve AWS infrastructure (EC2, Lambda, RDS, S3, SNS/SQS) ensuring stability, scalability, and cost-efficiency;
- Design, implement, and improve CI/CD pipelines (GitLab, GoCD) with proper quality gates (testing, linting, security checks);
- Standardize and improve environment setup (dev/staging/prod), enabling reliable and safe releases;
- Identify and remediate security gaps (IAM roles, access control, secrets management, compliance hygiene);
- Improve observability (monitoring, alerting, logging) and reduce incident response times;
Support and optimize hybrid architecture (on-prem Kafka
- Automate infrastructure and deployments using IaC (Terraform/CloudFormation);
- Actively propose and implement improvements (performance, reliability, cost, security);
- Ensure consistent communication: regular updates, early escalation of risks, clear reporting;
- Leverage AI tooling (e.g., Codex, AI-assisted DevOps tools) to improve automation, troubleshooting, and pipeline efficiency
Well-being:
- 15 working days of Paid Days Off within an individual year.
- Up to 5 working days of Unpaid Days Off within an individual year.
- 7 working days of Paid Sick Days Off
Professional Growth:
- Mentorship program – available on request.
- English courses and Speaking Club – attend English classes twice a week in small groups.
Added advantages:
- If you know someone you believe is a good fit for our cooperation, you can recommend them and get a reward.
- Public Holidays – celebrate 10 statutory holidays in India.
- Sombra events – Join Sombra’s traditional events (both online and offline).