Candidate should be able to work with Global operations & Core Practices teams to operate, industrialize & strengthen operational practices. Should have proven experience in handling large scale and growing infrastructure across Data Centers and heterogeneous Cloud platforms.
A Team player with good communication and problem-solving skills. Prior experience with DevOps practices & working in Global operations model (follow the sun), should be able to lead tasks independently & mentor junior team members.
MUST HAVE
- Strong experience in container orchestration and server automation tools such as Kubernetes, Docker, AWS EKS, Ansible & Terraform
- Experience in deploying and managing highly scalable fault resilient systems
-
Infrastructure as code (IAC) using Terraform.
-
CI/CD pipeline automation using Jenkins/GitLab CI/Travis
-
Scripting - Automation using Shell, Python, Groovy scripts
-
Strong Knowledge and experience of AWS services : ( 6 + years of Exp )
-
Compute Services (EC2 Creation)
-
AWS KeyPair creation
-
Route 53
-
Storage / IAM
-
VPN setup
-
ELB Creation
-
CloudWatch, CloudTrail
-
Cloud Formation
-
In depth knowledge of Monitoring tools like ICINGA, Prometheus
-
Able to design, maintain & support
-
DR & Failover architectures
-
OS/APP Patching & Upgrades
-
Incident management experience using runbooks & Troubleshooting
GOOD TO HAVE
Ticketing tools like Service Now, Jira
- ITIL certification, AWS Certification, Kubernetes Administrator
- Linux administration & commands knowledge.
- Networking - Virtual Network, DNS, IPs, Security Concepts (like ACLs, firewalls etc.)
Familiar with various network protocols e.g., HTTP/FTP/SFTP/SMTP
Knowledge of SSL, SSH, LDAP/firewalls, Certificates
Total Experience Expected: 6-8 years