Senior AI and HPC Modelling Architect

Posted:
12/22/2024, 5:42:15 AM

Location(s):
Tel Aviv-Yafo, Tel Aviv District, Israel ⋅ Tel Aviv District, Israel

Experience Level(s):
Senior

Field(s):
AI & Machine Learning ⋅ Software Engineering

Our technology has no boundaries! NVIDIA is building the world’s most groundbreaking and state of the art accelerated compute platforms for the world to use. It’s because of our work that scientists, researchers and engineers can advance their ideas. We pioneered a supercharged form of computing loved by the fastest paced computer users in the world - scientists, designers, artists, and gamers.

We are looking for a motivated network performance modeling architect to develop a network simulator with the purpose of analyzing and improving the performance of AI and High-Performance Computing workloads. You will solve complex problems and develop innovative software and hardware solutions using the network simulator.

What you’ll be doing:

  • Develop models for simulations, analyze simulation results, and develop optimization algorithms.

  • Analyze and model the communication patterns of key ML and AI applications.

  • Develop simulation components to evaluate and analyze micro-architecture and architectural networking solutions.

  • Executing workloads on AI systems, conducting profiling, and analyzing bottlenecks and possible enhancements.

  • Collaborate with multi-functional teams, including other architecture teams, VLSI logic design, system software, firmware, and research teams, to ensure our architecture is integrated into NVIDIA's products.

  • Spearheading the conceptualization of next-generation networking products tailored to support and accelerate state-of-the-art ML workloads.

What we need to see:

  • M.Sc. or Ph. D degree in Computer Science, Computer Engineering or Electrical Engineering.

  • Experience in developing simulation models.

  • At least 5 years of industry or research experience in computer networks, specifically in ML/AI workloads.

  • Great problem-solving and critical-thinking skills.

  • Ability to cooperate with multiple groups in the organization.

  • Ability to lead activities and push innovation.

Ways to stand out of the crowd:

  • Coding experience in C++, Python and/or NCCL.

  • Knowledge/hands-on experience in network protocols - such as InfiniBand and RoCE.

  • Knowledge/experience in network topologies, LLM or DL (research or design).

  • Architectural experience in SW-HW co-design.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.