Site Reliability Engineer

Posted:
11/19/2024, 5:53:53 AM

Location(s):
Masovian Voivodeship, Poland

Experience Level(s):
Junior ⋅ Mid Level

Field(s):
DevOps & Infrastructure ⋅ Software Engineering

Workplace Type:
Hybrid

Waymo is an autonomous driving technology company with the mission to be the most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo One, a fully autonomous ride-hailing service, and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over one million rider-only trips, enabled by its experience autonomously driving tens of millions of miles on public roads and tens of billions in simulation across 13+ U.S. states.

The Software Reliability Engineering (SRE) team at Waymo is in charge of overall reliability for Waymo's internal and external services – including our Waymo One ride hailing service. This work requires engineering skills, focused on architectural resiliency, optimizing safety and velocity, and automating as much as possible. As with most SRE teams, our team is Waymo is on-call, responding to the most urgent issues at Waymo. It is vital for SRE to perform our role so that the Waymo Driver can be available and in the hands of as many customers as possible.

In this hybrid role, you will report to a Senior Engineering Manager.

 

You will:

  • Audit and redesign key architectural flows like data reloads, experimentation, change management, and capacity management
  • Work closely with experts to design reliability and efficiency projects for massive systems
  • Build services, systems and tooling to increase automation, reliability and scalability of Waymo
  • Be on-call for Waymo's most important software systems, and also act as Incident Coordinators for resolving any potential incidents
  • Own areas of Waymo's production landscape, becoming the go-to person for the continual operation of a service or umbrella of work

 

You have:

  • B.S. or M.S. in Computer Science or related technical field or equivalent practical experience
  • 2+ years experience as a reliability engineer, or 4+ years working with production systems
  • Excited about working on self-driving cars, and excited about reliability!
  • Ability to work outside standard hours, including beyond 5 pm CET, required due to close collaboration with US-based engineering teams

 

We prefer:

  • Coding experience in C++
  • Experience with NALSD
  • Background in reliability engineering for a customer-facing product

#LI-Hybrid