Posted:
7/16/2024, 5:00:00 PM
Location(s):
Taipei, Taiwan
Experience Level(s):
Senior
Field(s):
Software Engineering
NVIDIA is the world leader in GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC, datacenters and networking in addition to our traditional OEM business. NVIDIA is also well positioned as the ‘AI Computing Company’, and NVIDIA GPUs are the brains powering Deep Learning software frameworks, analytics, data centers, and driving autonomous vehicles. We have some of the most experienced and dedicated people in the world working for us. If you are dedicated, forward-thinking, and hard-working technical people across countries sounds exciting, this job is for you.
NVIDIA is looking for an outstanding individual who thrives in a diverse work environment, has outstanding interpersonal skills and possesses a strong sense of engagement and continuous process improvement. This candidate must have enterprise system integration, strong OS/Virtualization experience, reliability testing with various telemetries, scale out cluster, test plan and automation development experience to join our platform SWQA team.
What you’ll be doing:
Responsible for the development and execution of NVIDIA MGX/HGX/DGX platform test plan on OS, FW and CUDA SW stack from design doc. Installing and testing various systems OS, system firmware and SW stack.
Drive support for root cause analysis on reliability and validation test failures to identify root cause(s) and achieve mitigation.
Build, develop/debug automation front-end and back-end framework and tests
Review partner and supplier test results and prescribe additional reliability testing on components, systems, and packaging as needed.
Work in an agile software development team with very high production quality standards, and Manage bug lifecycle and collaborate with inter-groups to drive for solutions.
What we need to see:
Bachelor’s Degree (or equivalent experience) in a STEM (Science, Technology, Engineering, Math or Physics) field with 5+ years proven experience; or Master’s Degree with 3 years of meaningful work experience
Proven years of automation experience using Python, Ansible, Jenkins, C/C++
Strong OS(Ubuntu, RedHat, CentOS, SuSE, Fedora, Windows and etc…) trouble-shooting and debugging experience in a bare-metal and KVM/VMWare environment.
Ability to write test plans focusing on functional, performance, stress and negative testing.
Proven years of experience in GitHub/Gitlab/Gerrit, PXE, SLURM, Stack/Kubernetes/Docker) – huge plus
Ways to stand out from the crowd:
Experience working with NVIDIA GPU hardware is a strong plus.
Good to have solid understanding of virtualization in Linux (KVM, Docker orchestrated with Kubernetes)
Expertise in packaging software in Linux (rpm, debs)
Background in parallel programming ideally CUDA/OpenCL is a plus
Experience of developing x86/ARM based environment.
With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the most desirable employers in the world. We have some of the most brilliant and talented people in the world working for us. If you are creative, autonomous and love a challenge, we want to hear from you. We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Website: https://www.nvidia.com/
Headquarter Location: Santa Clara, California, United States
Employee Count: 10001+
Year Founded: 1993
IPO Status: Public
Last Funding Type: Grant
Industries: Artificial Intelligence (AI) ⋅ GPU ⋅ Hardware ⋅ Software ⋅ Virtual Reality