Key Responsibilities
- Design, deploy, and manage cloud infrastructure on AWS, ensuring scalability, security, and cost
efficiency.
- Containerize applications using Docker and orchestrate workloads at scale with Kubernetes (EKS
preferred).
- Build and maintain CI/CD pipelines to enable fast, reliable, and automated software delivery.
- Implement Infrastructure as Code (IaC) using tools such as Terraform, CloudFormation, or
Pulumi.
- Set up robust monitoring, logging, and alerting using tools like Prometheus, Grafana,
CloudWatch, ELK, or Datadog.
- Drive observability, incident response, root cause analysis, and continuous reliability
improvements.
- Harden infrastructure security: IAM, secrets management, network policies, vulnerability
scanning, and compliance controls.
- Collaborate with developers to optimize application performance, deployment workflows, and
developer experience.
- Automate operational tasks using scripting (Bash, Python, or Go) to eliminate toil.
- Document architecture, runbooks, and operational best practices.
Must-Have Qualifications
- 5+ years of professional experience as a DevOps Engineer, SRE, or in a similar infrastructure
role.
- AWS: Strong hands-on experience with core services (EC2, VPC, S3, IAM, RDS, EKS, Lambda,
CloudWatch, Route 53).
- Docker: Solid expertise in containerization, writing efficient Dockerfiles, and managing container
lifecycles.
- Kubernetes: Production experience deploying and operating clusters, including Helm, ingress,
autoscaling, and troubleshooting.
- CI/CD: Experience building pipelines with tools like GitHub Actions, GitLab CI, Jenkins, or
ArgoCD.
- Linux & Scripting: Strong Linux administration skills and proficiency in Bash, Python, or Go.
- IaC: Practical experience with Terraform or equivalent infrastructure-as-code tooling.
- Monitoring & Observability: Experience with metrics, logs, and traces in production
environments.
Good-to-Have Qualifications
- Founding DevOps Engineer experience: Prior experience as the first or founding DevOps hire
at a startup, building infrastructure and DevOps culture from scratch.
- Medical / Healthcare data systems: Exposure to medical data systems, healthcare platforms, or
working with regulated data (HIPAA, HL7, FHIR, DICOM, or similar standards).
- Compliance & Security: Familiarity with SOC 2, HIPAA, ISO 27001, or other compliance
frameworks.
- Service Mesh & Advanced K8s: Experience with Istio, Linkerd, or operators and custom
controllers.
- Multi-cloud or Hybrid: Exposure to GCP or Azure in addition to AWS.
- Data & ML infra: Experience supporting data pipelines, model serving, or GPU workloads.