Do you want to help build some of the largest and most consequential enterprise and customer technology systems in the world? Join Apple’s Information Systems and Technology (IS&T) organization. IS&T is the engine behind everything Apple does for customers and for the people who build for them. It’s Apple’s central nervous system. Supporting 2.5 billion active Apple devices, processing billions of secure transactions, and keeping the technology that defines modern life running flawlessly, IS&T makes the impossible feel effortless.
Do you love building solutions to handle global complexity and immense scale? Imagine what you could do here.
Customer Systems is part of IS&T and drives the technology behind Apple's customer support experience - from contact center operations to the software powering the iconic Genius Bar. The team also builds and operates AppleCare's online support platform, which handles 6 billion visits per year, delivering seamless, high-quality support to Apple customers around the globe.
Description
The Customer Systems Team is looking for an experienced Site Reliability Engineer. In this role you will design, build and deliver highly scalable, reliable, secure cloud infrastructure which powers the applications and services used by Apple’s customers every day. You will work closely with cross functional teams, business leaders and other partners across Apple to implement new solutions. If infrastructure as code, automation and intelligent monitoring excites you then this is the job for you.
Preferred Qualifications
Cloud architecture, building reliable, scalable, and secure Infrastructure as Code
Troubleshooting of application specific, network, system & performance issues in production during on-call rotations
Building automation tools to deliver infrastructure services reliably and in a repeatable fashion
Collaborating cross-functionally with distributed teams of software engineers, quality engineers, or other site reliability engineers to gather, analyze, and define non-functional/technical requirements and drive its implementation
Experience with Cassandra, MongoDB, Couchbase databases, AWS S3 or similar storage technologies
Experience deploying and supporting java applications
Deep understanding of networking protocols: DNS, TCP, HTTP/HTTPS
Excellent problem solving, critical thinking, and interpersonal skills
Experience in Linux Shell Scripting, Python, Terraform
BS or MS in Computer Science, or equivalent experience
Minimum Qualifications
5+ years experience in designing and building resilient, large-scale, low latency, cloud and on-prem Infrastructure including Compute, Storage, and Network
Deep expertise in building, deploying and managing Kubernetes clusters using Spinnaker and Helm
Experience in monitoring using Splunk or ELK stack, Grafana, Prometheus, Alertmanager
Experience in setting up and managing CI/CD pipeline using Jenkins