AWS Infrastructure & Architecture
Design, provision, and manage AWS infrastructure using Terraform, aligned with the AWS Well-Architected Framework
Application Load Balancer (ALB)
AWS Certificate Manager (ACM)
Kubernetes (EKS) Platform Operations
Own and operate EKS clusters end-to-end:
Managed node group lifecycle management
Karpenter-based autoscaling
Cluster add-on lifecycle upgrades
IRSA (IAM Roles for Service Accounts) configuration
Multi-AZ high availability and resilience
Build and maintain automated deployment pipelines using:
Enable multi-environment deployments:
Implement release strategies:
Integrate AWS-native security and governance controls:
External Secrets Operator
Enforce policy controls using:
OPA / Kyverno (admission controllers)
Observability & Monitoring
Implement and manage observability stack:
Amazon Managed Prometheus
CloudWatch Container Insights
AWS X-Ray (distributed tracing)
Leverage AWS AI/ML services to support agent orchestration:
Amazon Bedrock (model inference, agent APIs)
SageMaker (model hosting, endpoints)
Comprehend (NLP, PII detection)
Cost Optimization (FinOps)
Implement cost-efficient architecture practices:
Karpenter bin-packing strategies
Scheduled scale-to-zero for non-production environments
Platform & Engineering Collaboration
Partner with platform and ML teams to:
Onboard new AI agent workloads
Integrate MCP servers and execution frameworks
Support extensibility of the agent ecosystem