We are seeking an experienced engineer with a strong focus on reliability, scalability, and automation to assist in developing and maintaining advanced analytics platforms on Google Cloud.
· Extensive hands-on experience with Google Cloud products, particularly within analytics or large-scale infrastructure settings
· Comprehensive knowledge of Site Reliability Engineering (SRE) principles, including SLIs/SLOs, error budgets, and incident management processes
· Proficiency in Infrastructure as Code practices (e.g., Terraform, Deployment Manager) and CI/CD pipeline development
· Expertise with observability tools such as Dynatrace and Cloud Monitoring
· Solid foundation in Docker, Linux, networking, and cloud security best practices
· Demonstrated experience working in DevOps environments with a focus on automation and resiliency
· Familiarity with container orchestration technologies (including Kubernetes and GKE) as well as serverless architectures
· solid knowledge of the Google cloud platform with specific experience in integrating and deploying the Vertex AI components
· Experience utilizing big data and analytics tools, such as BigQuery and Pub/Sub
· Foundational understanding of Python
· Google Cloud Platform certification is preferred