Senior Inference Performance Architect - Deep Learning

Posted:
9/20/2024, 7:58:00 AM

Location(s):
Oregon, United States ⋅ Hillsboro, Oregon, United States ⋅ Durham, North Carolina, United States ⋅ California, United States ⋅ North Carolina, United States

Experience Level(s):
Senior

Field(s):
AI & Machine Learning ⋅ Software Engineering

We are now looking for a Deep Learning Performance Analysis Architect! NVIDIA is seeking outstanding Performance Analysis Architects to help analyze and accelerate AI application performance at the intersection of both hardware and software. Intelligent machines powered by Artificial Intelligence that can learn, reason and interact with people are no longer science fiction. GPU Deep Learning has provided the foundation for machines to learn, perceive, reason and solve the world's most challenging problems. NVIDIA's GPUs excel at running AI algorithms, and act as the brains of computers, robots and self-driving cars that can perceive and understand the world.

What you’ll be doing:

  • Analyze performance and power efficiency of the most important deep learning inference workloads

  • Understand and analyze the interplay of hardware and software architectures on forward-looking algorithms, programming models and applications

  • Identify and prototype opportunities for performance optimization 

  • Actively collaborate with software, product and research teams to guide the direction of deep learning HW and SW 

What we need to see:

  • MS or PhD in Computer Science, Computer Engineering, Electrical Engineering or equivalent experience

  • 6+ years of relevant work/research experience

  • Solid foundation in machine learning and deep learning

  • Excellent programming skills in Python, C, C++

  • Strong background in computer architecture

  • Experience with performance modeling, architecture simulation, profiling, and analysis

  • A track record of creative solutions to technical challenges

Ways to stand out from the crowd:

  • CUDA programming skills

  • Background with deep neural network training, inference and optimization in leading frameworks (e.g. Pytorch, Tensorflow, TensorRT)

  • Experience with the architecture of or workload analysis on GPUs or other DL accelerators

Increasingly known as “the AI computing company”, NVIDIA wants you! Come join our Deep Learning Architecture team, where you can help build real-time, cost-effective computing platforms driving our success in this exciting and rapidly growing field.

#LI-Hybrid

The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#deeplearning

NVIDIA

Website: https://www.nvidia.com/

Headquarter Location: Santa Clara, California, United States

Employee Count: 10001+

Year Founded: 1993

IPO Status: Public

Last Funding Type: Grant

Industries: Artificial Intelligence (AI) ⋅ GPU ⋅ Hardware ⋅ Software ⋅ Virtual Reality