Deep Learning Performance Architect

Posted:
6/2/2025, 5:00:00 PM

Location(s):
Shanghai, China

Experience Level(s):
Mid Level ⋅ Senior

Field(s):
AI & Machine Learning

NVIDIA is developing processor and system architectures that accelerate deep learning and high-performance computing applications. We are looking for an expert deep learning performance architect to join our AI performance modelling, analysis and optimization efforts. In this position, you will have a chance to work on DL performance modelling, analysis, and optimization on state-of-the-art hardware architectures for various LLM workloads. You will make your contributions to our dynamic technology focused company. 

What you'll be doing:

  • Analyze state-of-the-art DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next gen inference products.

  • Develop analytical models for the state-of-the-art deep learning networks and algorithm to innovate processor and system architectures design for performance and efficiency.

  • Specify hardware/software configurations and metrics to analyze performance, power, and accuracy in existing and future uniprocessor and multiprocessor configurations.

  • Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software, and product teams.

What we need to see:

  • BS, MS or PhD in relevant discipline (CS, EE, Math, etc.) or equivalent experience.

  • 5+ years’ work experience.

  • Experience with popular AI models (e.g., LLM and AIGC models)

  • Be familiar with typical deep learning SW framework (e.g., Torch/JAX/TensorFlow/TensorRT)

  • Knowledge and experience on hardware architectures for deep learning applications

Ways to stand out from the crowd:

  • Background with CUDA and GPU computing systems

  • Experience on performance modelling or optimization of DL workloads

NVIDIA

Website: https://www.nvidia.com/

Headquarter Location: Santa Clara, California, United States

Employee Count: 10001+

Year Founded: 1993

IPO Status: Public

Last Funding Type: Grant

Industries: Artificial Intelligence (AI) ⋅ GPU ⋅ Hardware ⋅ Software ⋅ Virtual Reality