Lead Site Reliability Engineer

Posted:
2/23/2026, 12:37:23 AM

Location(s):
England, United Kingdom ⋅ Manchester, England, United Kingdom

Experience Level(s):
Senior

Field(s):
DevOps & Infrastructure ⋅ Software Engineering

Job Posting Title:

Lead Site Reliability Engineer

Req ID:

10143182

Job Description:

About the Role & Team 

 

We are seeking a subject matter expert in reliability engineering with experience leading engineering teams, driving technical innovation, and delivering high‑quality, high‑impact products against key reliability objectives. The ideal candidate will bring strong leadership and mentoring skills, a clear vision for building reliable systems, and the ability to guide technical solutions from concept through to delivery. 

 

As a Lead Site Reliability Engineer, you will set short‑ and long‑term goals to reduce toil through automation, improve incident management and observability, introduce tooling to identify and resolve critical reliability issues, and ensure high standards of documentation. The role is highly collaborative, with regular interaction across engineering teams, stakeholders, and leadership. 

You will join a dynamic, inclusive engineering organisation focused on continuous learning and improvement, working on reliability solutions that enable development teams to meet their service level objectives through ongoing measurement and optimisation. 

 

Within the Reliability Tooling team, you will write and review code, make key technical decisions, own roadmap delivery, and mentor Engineers and Senior Engineers. You will be a trusted go‑to expert on SRE principles and best practices, with a strong focus on SLIs and SLOs across large‑scale, user‑facing and internal services. 

Given our strong emphasis on innovation, the successful candidate will also be adaptable and comfortable pivoting as priorities evolve. 

 

 

Values 

You’ll join a team grounded in our Disney values — acting with Integrity, welcoming everyone through Inclusion, embracing boundless Creativity, working together through Collaboration and caring deeply for our Community. These values shape how we work and how we support one another every day. 

 

 

What You Will Do 

 

• Technical Decision Making 

• Structuring short- and long-term work for the team (Roadmapping) 

• Mentor and support team members with SRE best practices to ensure the team delivers to its stakeholders 

• Work closely with development teams and provide them with technical guidance to ensure new features have the proper operational support and maintainability 

• Responsible for interpreting the business domain into a digestible format for the Engineers 

• Develop software for the purposes of automating, monitoring and maintaining deployed infrastructure and services 

 Assist leadership with engineer reviews on a regular basis, working with them to develop and execute individual Career Development Plans and targets 

• Encourage and circulate Company culture amongst team members 

• Represent the Company at conferences and meetups 

 

Required Qualifications & Skills  

 

• Track record of working as a Lead Site Reliability Engineer 

• Understanding of SRE principles, patterns and best practices 

• 5+ Years experience in designing and implementing automation tools 

• 5+ Years Experience working with AWS Cloud Infrastructure and Resources, or equivalent large Cloud services provider 

• Proficient in either two of programming languages: Python, Golang, Rust 

• Experience with modern infrastructure services and concepts such as containerization, distributed systems, microservices, etc 

• Experience running and monitoring large scale distributed systems 

• Understanding of Software Engineering principles and patterns 

• Track record of working with Linux systems in production 

Preferred Qualifications: 

• Ability to understand the business domain from both a technical and business viewpoint 

• Experience working with distributed teams 

• Experience in an Operations role, providing support and troubleshooting technical issues 

 

 

The Perks  

 

  • 25 days annual leave 

  • Private medical insurance & dental care 

  • Free Park Entry: You will have the opportunity to enter any of our parks with your family and friends for free 

  • Disney Discounts: you are entitled to discounts on designated Disney products, resort F&B and ticketing 

  • Excellent parental and guardian leave 

  • Employee Resource Groups – WOMEN @ Disney, Disney DIVERSITY, Disney PRIDE, our new disability & neurodiversity focused group - ENABLED, and our Mental Health & Wellbeing Group, TRUST. 

 

The Walt Disney Company is an Equal Opportunity Employer. We strive to be a diverse workforce that is representative of our audiences, and where all can thrive and belong. Disney is committed to forming a team that includes and respects a variety of voices, identities, backgrounds, experiences and perspectives.  

We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation. 

 

 

Job Posting Segment:

PE - Sports, News & Entertainment, Enablement

Job Posting Primary Business:

PE - Sports, News & Entertainment, Enablement - Infrastructure Engineering

Primary Job Posting Category:

Site/System Reliability Engineer

Employment Type:

Full time

Primary City, State, Region, Postal Code:

Manchester, United Kingdom

Alternate City, State, Region, Postal Code:

Date Posted:

2026-02-23

The Walt Disney Company

Website: https://thewaltdisneycompany.com/

Headquarter Location: Burbank, California, United States

Employee Count: 10001+

Year Founded: 1923

IPO Status: Public

Last Funding Type: Post-IPO Debt

Industries: Amusement Park and Arcade ⋅ Animation ⋅ Consumer Goods ⋅ Digital Media ⋅ E-Commerce ⋅ Media and Entertainment ⋅ Multi-level Marketing ⋅ Performing Arts ⋅ Resorts