Technical Product Manager - Model Optimization

Posted:
10/2/2024, 3:57:20 AM

Location(s):
California, United States

Experience Level(s):
Mid Level

Field(s):
Product

Are you interested in bringing Artificial Intelligence to more applications? AI models have a rapidly growing demand in unique use cases. From Autonomous Vehicles to Generative AI assistants, and medical devices to shopping recommendations, each application has its own unique constraints and deployment requirements. One of the most effective ways to adapt to each of these varying needs is by optimizing the AI model itself; using quantization and sparsity to reduce memory footprint, neural architecture search to ensure the model is optimal for the task at head, distillation to significantly reduce model size & improve performance, and much more. We are looking for a Product Manager who understands these deep learning techniques and desires to drive the ecosystem forward with more advancements.

At NVIDIA, we are crafting solutions to optimize AI models to support all of these use cases, as well as the ones that come next! As a Product Manager for model optimization, you are the customer champion inside NVIDIA for developers working to deploy AI models in constrained environments. As Product Managers, we work directly with developers inside and outside of NVIDIA to identify new optimizations & features, define a roadmap for when they can be available, and stay alert on alternative solutions. In addition, these products are new to the space and will require a go-to-market strategy & clear product direction. The Product Management organization at NVIDIA is a small, strong, and impactful group. Our team focuses on enabling deep learning across all GPU use cases and providing great solutions for developers. We are seeking a rare blend of product skills, technical depth, and passion for creating new technology. If that's you, we would love to hear from you!

What you’ll be doing:

  • Develop product strategy and go-to-market plans

  • Collaborate with internal and external deep learning engineers and researchers to build product-based roadmaps for model optimization software

  • Help channel customer usability feedback from the external community back to the internal teams to improve NVIDIA products

  • Working with NVIDIA leadership to align with and drive company strategy

  • Be the champion in MLOps developer community to promote our platform

What we need to see:

  • BS or MS degree in Computer Science, Computer Engineering, or similar field or equivalent experience

  • 3+ years of technical product management, or similar, experience at a technology company

  • Strong communication and interpersonal skills

  • Proven knowledge of deep learning or machine learning concepts, and software development and delivery

Ways to stand out from the crowd:

  • Experience working on emerging products and bringing them to market

  • Understanding of deep learning model optimization algorithms, like quantization, sparsity, adapters, and NAS

  • Familiarity with working with deep learning frameworks, like PyTorch, JAX, & TensorFlow, and Deep Learning Compilers, such as TensorRT, XLA, Inductor, and TVM

  • Knowledge of GPU architecture, HW/SW co-design, and performance profiling

The base salary range is 108,000 USD - 218,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

NVIDIA

Website: https://www.nvidia.com/

Headquarter Location: Santa Clara, California, United States

Employee Count: 10001+

Year Founded: 1993

IPO Status: Public

Last Funding Type: Grant

Industries: Artificial Intelligence (AI) ⋅ GPU ⋅ Hardware ⋅ Software ⋅ Virtual Reality