Solutions Architect, Data Center Infrastructure

Posted:
7/28/2024, 5:00:00 PM

Location(s):
Texas, United States ⋅ California, United States ⋅ Virginia, United States

Experience Level(s):
Senior

Field(s):
DevOps & Infrastructure

Workplace Type:
Remote

NVIDIA is seeking a Solutions Architect in Data Center Infrastructure to join our Infrastructure Specialists team. Academic and commercial groups worldwide are using NVIDIA products to redefine deep learning, data analytics, and power data centers. Join the team building many of the world's largest and fastest data centers and supercomputers! NVIDIA is looking for someone who can lead planning and deployments of AI data centers including power/cooling systems, cabling and network provisioning and bring-up/validation.

As the NVIS Solutions Architect for Datacenter Infrastructure, you will focus on data center audit, planning and deployment ensuring the integrity of NVIDIA platform infrastructure. Your primary goal will be to guarantee that all aspects of the data center's physical infrastructure are meticulously planned, implemented, and validated to meet NVIDIA reference architectures, operational requirements, and industry standards. This infrastructure includes architectural systems, power distribution, liquid/air cooling systems, compute, network and cabling (fiber and copper), telemetry and all other physical infrastructure.

What you will be doing:

  • NVIS Datacenter Engineering and planning: Collaborate with other teams to plan and implement data center infrastructure solutions based on NVIDIA Datacenter reference architecture, including power distribution, cooling systems, network architecture, server hardware, and storage systems.

  • Operations Audit Planning: Develop and implement comprehensive audit plans to assess data center infrastructure components' operational efficiency, reliability, and readiness. Conduct pre-deployment audits to identify potential issues, risks, and areas for improvement.

  • Infrastructure Design Oversight: Review and evaluate customers' and partners' infrastructure design proposals, ensuring consistency with best practices, industry standards, and regulatory requirements. Provide feedback and recommendations to improve performance, scalability, and cost-effectiveness.

  • Act as the NVIS mentor providing guidance, mentorship, and support to ensure the NVIS team's success in their respective roles.

  • Quality Assurance: Establish and enforce quality assurance processes to verify that deployments meet established specifications and performance benchmarks. Conduct thorough bring-up, testing, and validation to validate the functionality and reliability of infrastructure components.

  • Continuous Improvement: Drive continuous improvement initiatives to enhance data center infrastructure efficiency for NVIDIA data center reference architecture and deployment blueprint, resilience, and sustainability. Find opportunities to streamline processes, automate repetitive tasks, and leverage emerging technologies to optimize infrastructure operations.

  • Collaboration and Communication: Collaborate and communicate across internal teams, external vendors, and customers to facilitate the seamless integration of data center infrastructure solutions. Serve as a domain expert and point of contact for infrastructure-related inquiries and escalations.

What we need to see:

  • Bachelor's degree (or equivalent experience) in Engineering, Computer Science, Information Technology, or a related field. Advanced degree or relevant certifications preferred.

  • 10+ years of overall experience in enterprise and/or hyperscale data centers with continual infrastructure and service improvement, preferably for high density AI/HPC data centers.

  • Proven experience in data center engineering, operations, or infrastructure management roles, focusing on large-scale data center deployments.

  • Strong technical Knowledge and experience in the data center stack - power distribution, liquid cooling, servers, networking, storage and pre-deployment planning

  • Relevant certification – preferred

  • Demonstrated technical and project leadership under fluid situations, ability to adapt to unknowns and change.

  • Excellent analytical, problem-solving, and decision-making skills, keen attention to detail, and a commitment to quality.

  • Effective communication and interpersonal skills, with the ability to interact professionally with diverse collaborators including customers and facilitate productive discussions.

  • Organization & Time Management – able to plan, schedule, and organize tasks related to the job to achieve goals within or ahead of established time frames.

  • Willingness to travel (50%).

Way to stand out from the crowd:

  • Experience in data center operations process, safety, and security measures.

  • Strong knowledge of whole data center Infrastructure stack

  • Outstanding social skills.

NVIDIA is widely considered one of the world's most desirable employers in technology. We have some of the world's most forward-thinking and passionate people working for us. If you're creative and autonomous, we want to hear from you.

The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.