Do you want to help build some of the largest and most consequential enterprise and customer technology systems in the world? Join Apple’s Information Systems and Technology (IS\u0026T) organization.\\nIS\u0026T is the engine behind everything Apple does for customers and for the people who build for them. It’s Apple’s central nervous system. Supporting 2.5 billion active Apple devices, processing billions of secure transactions, and keeping the technology that defines modern life running flawlessly, IS\u0026T makes the impossible feel effortless.”\\n\\nDo you love building solutions to handle global complexity and immense scale? Imagine what you could do here.\\n\\nInfrastructure Services is part of IS\u0026T and the foundation of Apple"s global network operations — managing data center equipment and systems to deliver compute, storage, and networking services for teams across Apple, including its internal developer community. From individual facilities to a worldwide network, Infrastructure Services ensures the technology underneath everything works without question.\\n
We are seeking a Senior DevOps Engineer with deep expertise in cloud infrastructure, Kubernetes, and platform operations, combined with a forward-looking mindset in AI-driven automation. This role partners closely with the Senior DevOps Engineering Manager to scale and modernize our cloud platform, improve operational excellence, and embed intelligent automation across DevOps workflows.\\n
Cloud Platform and Infrastructure: Design, build, and operate scalable, secure, and cost-efficient cloud environments on AWS. Lead cloud migration and modernization efforts, from VMs to containers to cloud-native architectures. Establish and enforce infrastructure standards, governance, and best practices. Drive improvements in availability, scalability, and performance.\\n\\nKubernetes and Platform Engineering: Architect, deploy, and operate large-scale Kubernetes platforms. Build and maintain multi-cluster and multi-tenant architectures. Improve developer experience through platform tooling and self-service capabilities. Optimize workloads for cost, performance, and reliability. Lead cluster lifecycle management, upgrades, and security hardening.\\n\\nAI-Driven Automation: Design and implement AI-powered automation across DevOps workflows, including incident triage, root cause analysis, and runbook automation. Build and integrate intelligent systems using LLM APIs and observability platforms. Identify automation opportunities across engineering teams and drive adoption. Evaluate and implement emerging AI/ML tools for operational use cases.\\n\\nSoftware Engineering (Golang): Develop internal tools, operators, and automation systems using Golang. Build APIs, controllers, and integrations for infrastructure and platform services. Contribute to reusable frameworks for automation and orchestration.\\n\\nOperations and Reliability Engineering: Own and improve production operations, including incident response and postmortems. Define and implement SLOs, SLIs, and error budgets. Enhance observability (metrics, logs, traces) using tools such as Prometheus and Grafana. Drive continuous improvement in operational processes and runbooks.\\n\\nLeadership and Collaboration: Act as a technical leader and mentor within the DevOps team. Partner with architects, developers, and product teams on system design and reliability. Influence roadmap decisions for cloud platform evolution. Help upskill team members in Kubernetes, cloud, and AI-driven DevOps practices.
Strong experience in DevOps, SRE, or Platform Engineering roles. \\nDeep hands-on experience with Kubernetes in production at scale. \\nStrong expertise in AWS (Azure or GCP experience also valued). \\nProficiency in Golang for building infrastructure and automation tools. \\nStrong understanding of CI/CD systems and pipeline design. \\nStrong understanding of distributed systems and microservices architectures. \\nProven experience in production operations and incident management. \\nExperience building or maintaining observability platforms.
10+ years experience in DevOps, SRE, or Platform Engineering roles.\\nExperience with AI/ML or LLM-based automation in DevOps workflows. \\nFamiliarity with orchestration frameworks such as LangChain or LangGraph. \\nExperience building Kubernetes operators or controllers. \\nBackground in platform engineering or internal developer platforms.