Senior Software Engineer, Quantized Training

Posted:
8/23/2024, 3:07:43 PM

Location(s):
California, United States ⋅ Washington, United States ⋅ Seattle, Washington, United States

Experience Level(s):
Senior

Field(s):
Software Engineering

We are now looking for a Senior Software Engineer for Quantized Training. We are a team committed to developing next-generation quantized training recipes for Hopper and future GPUs. We are seeking software engineers to help rethink and create tailored solutions to accelerate the discovery of new recipes. This is a coding-heavy role focused on building infrastructure, tooling, and visualizations.

The candidate's work directly supports NVIDIA's production SW systems including Megatron-LM and Transformer Engine. The candidate will be part of a core team of engineers and researchers working in lock step to improve quantized training convergence and efficiency.

What you'll be doing:

  • Create well-tested SW systems and PoCs in support of quantized training

  • Build visualization tools to track and assess the health of model training

  • Benchmark internal and external methods for quantized training

  • Build an insights platform for tracking model metrics and benchmarks

  • Architect CI/CD systems for versioning training recipes

  • Participate in code reviews

What we need to see:

  • A Masters Degree or PhD or meaningful equivalent experience in Computer Science/Computer Engineering or a related field.

  • 5+ years of relevant software development experience.

  • Strong software engineering background with a focus on building concise and well-tested code in C++ and Python

  • Experience working with ML accelerators and PyTorch or similar frameworks

  • Good foundation in ML training and quantization

  • Strong written and oral communication skills

Ways to stand out from the crowd:

  • Experience with CUDA, performance optimization and debugging

  • Proficient in precision and numerics for ML

GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud. We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.

Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. Additionally, this opportunity offers you the ability to collaborate with some of the most forward-thinking and hard-working people in the world, shaping the future of AI in a creative and autonomous work environment that encourages innovation. Do you love the challenge of influencing the long-term opportunities that expand NVIDIA’s impact on the datacenter and beyond? If so, we want to hear from you!

The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

NVIDIA

Website: https://www.nvidia.com/

Headquarter Location: Santa Clara, California, United States

Employee Count: 10001+

Year Founded: 1993

IPO Status: Public

Last Funding Type: Grant

Industries: Artificial Intelligence (AI) ⋅ GPU ⋅ Hardware ⋅ Software ⋅ Virtual Reality