DevOps Engineer
Role Summary
We are looking for a DevOps engineer to build and operate cloud-native foundations and to partner closely with AI and application engineers. This is a hands-on role spanning DevOps fundamentals — Kubernetes, CI/CD, GitOps, and infrastructure-as-code — and AI/ML infrastructure such as GPU-backed model serving and the data services that AI applications depend on.
You will help stand up pragmatic, right-sized platform foundations and act as the infrastructure partner to application engineers — deploying and scaling their services reliably, keeping the developer loop fast, and building security and observability in from the start. We are looking for a candidate with a strong academic record and solid computer-science fundamentals (data structures, algorithms, and problem-solving) alongside hands-on platform experience.
Key Responsibilities
DevOps Foundations
- Bootstrap and operate Kubernetes (AKS) plus the surrounding cloud-native foundations: container registry, secrets management, identity, and monitoring.
- Own the build-and-deploy path end to end — containerisation (Docker/BuildKit), image registry, CI/CD (GitHub Actions or Azure Pipelines), and GitOps (Argo CD or Flux).
- Codify infrastructure with Terraform and/or Bicep and Helm so environments are reproducible and reviewable.
AI Infrastructure & Data Plane
- Package and deploy application and AI/agent services, including the runtime, secrets, and configuration they require.
- Provision and operate the data plane: managed PostgreSQL, Redis, a vector store, a graph store, object storage, and a queue / event bus.
- Stand up self-hosted inference — e.g. vLLM on a GPU node pool — alongside hosted-model connectors; use KEDA/HPA to scale stateless pods (down to zero) and bin-pack GPU efficiently.
Isolation, Security & Reliability
- Implement tenant/workload isolation: namespaces, deny-by-default NetworkPolicy, ResourceQuota/LimitRange, and scoped secrets.
- Manage secrets and identity with a secrets manager / KMS and an identity provider; keep credentials out of code.
- Establish observability and basic SRE practice: OpenTelemetry, Prometheus + Grafana (or cloud-native monitoring), tracing, alerting, and runbooks.
Collaboration & Cost
- Pair with application engineers on inference, persistence, and durable-execution concerns.
- Keep the platform cost-aware — right-size nodes, scale to zero, and surface spend.
- Contribute to engineering standards and a fast, low-friction developer loop.
Required Technical Skills
Domain
Skills & Technologies
Must / Preferred
CS Fundamentals & DSA
Data structures, algorithms, complexity analysis, strong problem-solving
Must
Kubernetes
AKS — deployments, services, autoscaling, ingress, namespace isolation
Must
DevOps Fundamentals
Docker, CI/CD (GitHub Actions or Azure Pipelines), Git workflows
Must
Infrastructure as Code
Terraform (and/or Bicep), Helm
Must
Cloud
Azure (AKS, ACR, Key Vault, Entra ID, Monitor); managed PostgreSQL/Redis/Blob/Service Bus
Must
GitOps
Argo CD or Flux
Must
Observability
OpenTelemetry, Prometheus, Grafana (or cloud-native monitoring)
Must
Scripting
Python and/or Bash for glue, automation, and debugging app services
Must
GPU / ML Infra
GPU scheduling, serving LLMs with vLLM, GPU bin-packing & cost optimisation
Preferred
Autoscaling
KEDA event-driven autoscaling, scale-to-zero
Preferred
App Runtimes
Deploying Python application / AI services (e.g. FastAPI, LangGraph), Temporal
Preferred
Mesh / Gateway / Secrets
Istio, Kong / API gateways, HashiCorp Vault
Preferred
Sandboxing & Supply Chain
gVisor / Kata Containers; Sigstore/cosign, SBOM (Syft), SLSA
Preferred
Qualifications & Certifications
- Strong academic record — B.Tech / B.E. / M.Tech / MCA in Computer Science or a related field from a reputable institution (or equivalent).
- 2–3 years of hands-on experience in DevOps / platform / infrastructure engineering, with practical Kubernetes exposure.
- Strong data structures, algorithms, and problem-solving skills.
- Experience supporting application engineers (deploying services, debugging runtime issues) and working with cloud infrastructure; exposure to production clusters is a plus.
Preferred Certifications
- Certified Kubernetes Administrator (CKA) / CKAD
- Microsoft Certified: Azure Administrator / Azure Solutions Architect
- HashiCorp Terraform Associate
Soft Skills & Cultural Fit
- Pragmatic and outcome-driven — builds the minimum viable platform first, avoids premature complexity.
- Collaborative; genuinely enjoys pairing with application engineers and unblocking their deployment/runtime problems.
- Strong ownership and on-call maturity; calm, methodical incident response.
- Clear communicator across technical and non-technical stakeholders.
What We Offer
- Hands-on work across modern cloud-native and AI infrastructure — Kubernetes, GPU inference, GitOps, and durable workflows.
- High ownership of the platform and infrastructure from an early stage.
- Competitive compensation with a structured performance review process.
- Professional development support — certifications (CKA, Azure), conferences, and emerging tooling.
- Collaborative, transparent culture with clear growth pathways toward Staff / Principal platform engineering.
About Softobiz Technologies
Softobiz Technologies is a technology and product services company headquartered in India, operating Global Capability Centers (GCCs) for leading international clients across healthcare, fintech, and enterprise software. Our GCC model enables world-class talent in India to work directly within the product and engineering teams of our global partners, contributing meaningfully to product strategy, growth, and operations
Innovation begins with like-minded people aiming to transform the world together. At Softobiz, we invite you to become a part of an organization that has been helping clients transform their business by fusing insights, creativity, and technology. With a team of 400+ technology enthusiasts, we have been trusted by leading enterprises around the globe for over 18+ years.
At Softobiz, we foster a culture of equality, learning, collaboration, and creative freedom, empowering our employees to grow and excel in their careers. Our technical craftsmen are pioneers in the latest technologies like AI, machine learning, and product development.
Why Should You Join Softobiz?
- Work with technical craftsmen who are pioneers in the latest technologies.
- Access training sessions and skill-enhancement courses for personal and professional growth.
- Be rewarded for exceptional performance and celebrate success through engaging parties.
- Experience a culture that embraces diversity and creates an inclusive environment for all employees.
Softobiz is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will be afforded equal employment opportunities without discrimination based on race, creed, color, national origin, sex, age, disability, or marital status.