DL Computing Performance Architect

Posted:
7/28/2024, 5:00:00 PM

Location(s):
Shanghai, China

Experience Level(s):
Mid Level

Field(s):
AI & Machine Learning ⋅ Software Engineering

NVIDIA is developing processor and system architectures that accelerate machine learning, automotive and high performance computing (HPC) applications. We are looking for a technical expert to lead our DL performance projections and analysis effort.  This position offers the opportunity to make a meaningful impact in a fast-moving, technology focused company.

What you'll be doing:

  • Establish DL applications and use-cases for analysis and projections.

  • Specify hardware/software configurations and metrics to analyze performance, power, accuracy and resiliency in uniprocessor and multiprocessor configurations

  • Create and maintain workloads and micro-benchmark suites.

  • Generate projections, comparisons and analysis reports for internal/external consumption.

  • Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software and product teams.

What we need to see:

  • 4+ years working experience on relevant industy.

  • Strong software skills with C/C++, Python, MPI, OpenMP etc.

  • Experience of DL workload and operator optimization and performance analysis will be a plus.

  • Familiarity with GPU computing and parallel programming models will be a plus.

  • Excellent oral and written communication skills.

  • Good organizational, time management and task prioritization skills.