2025 Summer Intern, MS/PhD, Scalable ML Training Infrastructure

Posted:
11/8/2024, 6:24:10 AM

Location(s):
Mountain View, California, United States ⋅ California, United States

Experience Level(s):
Internship

Field(s):
AI & Machine Learning ⋅ Software Engineering

Workplace Type:
Hybrid

Waymo is an autonomous driving technology company with the mission to be the most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo One, a fully autonomous ride-hailing service, and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over one million rider-only trips, enabled by its experience autonomously driving tens of millions of miles on public roads and tens of billions in simulation across 13+ U.S. states.

Software Engineering builds the brains of Waymo's fully autonomous driving technology. Our software allows the Waymo Driver to perceive the world around it, make the right decision for every situation, and deliver people safely to their destinations. We think deeply and solve complex technical challenges in areas like robotics, perception, decision-making and deep learning, while collaborating with hardware and systems engineers. If you're a software engineer or researcher who's curious and passionate about Level 4 autonomous driving, we'd like to meet you.

Waymo interns work with leaders in the industry on projects that deliver significant impact to the company. We believe learning is a two-way street: applying your knowledge while providing you with opportunities to expand your skillset. Interns are an important part of our culture and our recruiting pipeline. Join us at Waymo for a fun and rewarding internship!

This internship will be based on-site at our headquarters in Mountain View, CA.

You will:

  • Scale distributed ML training frameworks to large clusters with thousands of accelerators
  • Build mathematical models and conduct real experiments to analyze performance bottlenecks
  • Improve distributed training efficiency by jointly optimizing communication and computation with cutting-edge technologies on ML runtime and compilers

You have:

  • Progressing towards MS or PhD in Computer Science or related technical field.
  • Python/C++ coding skills 
  • Familiarity with internals of ML frameworks (JAX, TensorFlow, PyTorch) and distributed training algorithms
  • Solid understanding of basics and algorithms of linear algebra, such as multi-dimensional MatMul and matrix calculus.
  • Knowledge of deep learning models and optimization

We prefer:

  • Familiarity using ML accelerators (GPU/TPU) with ML Compilers (TensorRT, XLA, etc.)
  • Prior work on cloud computing platforms (AWS, Azure, GCP)

Note: This will be a hybrid onsite internship position. We will accept resumes on a rolling basis until the role is filled. To be in consideration for multiple roles, you will need to apply to each one individually - please apply to the top 3 roles you are interested in.

The expected hourly rate for this full-time position is listed below. Interns are also eligible to participate in the Company’s generous benefits programs, subject to eligibility requirements.
Hourly Masters Pay
$50.48$50.48 USD
The expected hourly rate for this full-time position is listed below. Interns are also eligible to participate in the Company’s generous benefits programs, subject to eligibility requirements.
Hourly PhD Pay
$60.10$60.10 USD