Posted:
7/3/2024, 5:00:00 PM
Location(s):
California, United States
Experience Level(s):
Expert or higher ⋅ Senior
Field(s):
Software Engineering
Workplace Type:
Remote
NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We are hiring Software Architect who will help build & own system architecture of simulators for our DGX and HGX Server platforms. Simulations play a significant role in building scalable systems at Speed of Light! You will work with world class engineering teams across HW and SW and engage with industry leading cloud service providers on taking these products to market.
What you’ll be doing:
Drive requirements, architecture, and roadmap of NVIDIA DGX Simulation platforms.
Engage with major customers to understand their requirements and work to align with their roadmap and adoption strategy.
Work closely with hardware modeling, kernel & platform driver teams distributed globally.
Build and deliver full server simulation platform to internal and external NVIDIA partners.
Mentor architects and engineering teams to grow them into future leaders.
Make key technical decisions even when faced with ambiguity and mitigate execution risks by following left shift strategy.
What we need to see:
BS degree or higher (Computer science or related) or equivalent experience
10+ years of relevant experience in virtualization and HW simulation/emulation technologies.
Proven experience in designing architecture for scalable and performant server systems, particularly at the SW/HW interface.
Previous experience around hardware interfaces such as PCIe, SPI, I3C etc with Linux boot solutions on x86 & ARM class platforms.
Good understanding of hypervisors & HW emulators, like Qemu, KVM, VDK, SIMICs etc.
Experience in Out of Band and Inband management architectures.
Proficient in C / C++ with strong software development, optimization, user & kernel mode debugging skills.
Strong interpersonal & communication skills to work with a globally distributed engineering team.
Ways to stand out from the crowd:
Experience in building left shift strategy around HW & SW stack bringup using Simulators & Emulators etc.
Contribution in Qemu/KVM opensource repositories.
Experience in Verilog and SystemC
Knowledge of device management protocols such as MCTP, PLDM and RDE.
Knowledge of system management protocols such as Redfish and IPMI.
You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Website: https://www.nvidia.com/
Headquarter Location: Santa Clara, California, United States
Employee Count: 10001+
Year Founded: 1993
IPO Status: Public
Last Funding Type: Grant
Industries: Artificial Intelligence (AI) ⋅ GPU ⋅ Hardware ⋅ Software ⋅ Virtual Reality