Software Engineer, LLM MLOps

Posted:
9/10/2024, 11:11:18 AM

Location(s):
California, United States

Experience Level(s):
Senior

Field(s):
DevOps & Infrastructure ⋅ Software Engineering

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s an outstanding legacy of innovation that’s motivated by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world!

As a Senior Software Engineer on the LLM MLOps team at NVIDIA, you will have the unique opportunity to contribute to the development of future technologies in the GenAI space. The team is responsible for building platforms to produce high quality human annotated data to train the NextGen LLMs at NVIDIA, build tools and workflows for model fine-tuning and model evaluation.

What you'll be doing:

  • Collaborating with the team to integrate with human annotation tooling, cloud infrastructure, backend services, visualization dashboard, telemetry system, CI/CD process, and more.

  • Building platforms, web services, and tools to produce high-quality human annotated data for training the next generation of LLMs at Nvidia.

  • Working in a fast-paced and highly experimental environment to solve complex problems and implement innovative solutions.

  • Designing and maintaining large-scale backend clusters in cloud environments.

  • Collaborating with diverse teams to ensure flawless execution and successful implementation of projects.

  • System design and architecture discussions to determine the best approaches and solutions.

  • Mentoring and guiding junior engineers to help them grow and develop their skills.

  • You will be part of an exceptional team of engineers and researchers, working together to explore the frontiers of AI . Your expertise will play a crucial role in driving the progress of innovative technologies.

What we need to see:

  • 8+ years of software development and system design experience in cloud environments.

  • BS or MS or equivalent experience

  • Strong programming skills in languages like Python, C++, or Java.

  • Solid computer science fundamentals in data structures, algorithms and object-oriented design.

  • Experience in transforming product requirements into software architecture and ability to design technically sophisticated distributed systems and RESTful services.

  • Solid understanding of software development lifecycles, testing methodologies, and QA automation processes including software engineering standard methodologies (e.g., version control, code reviews, testing, and CI/CD).

  • Experience with monitoring production environments: metrics, alarming, dashboards, integration tests, canaries, log aggregation, various CI/CD processes and tools, etc.

  • Strong communication and collaboration abilities with ML model researchers.

  • Excellent problem-solving and analytical skills.

  • Experience with cloud-based ML platforms (e.g., AWS SageMaker, Azure ML, Google AI Platform).

  • Background with Dockers and Kubernetes and familiarity with ONNX and TensorRT.

Ways To Stand Out From The Crowd:

  • Experience in human annotation tools for GenAI annotation tasks.

  • Proven track record of published research or contributions to open-source ML libraries and tools.

  • Passion for AI and a demonstrated commitment to advancing the field through innovative research and development.

  • Application of instruction tuning, reinforcement learning from human feedback (RLHF), and parameter efficient fine tuning such as p-tuning, adaptors, LoRA, and so on to improve LLMs for different use cases.

If you are passionate about solving sophisticated problems, thrive in a collaborative and ambitious environment, and want to be a part of a world-class team, then this is the perfect opportunity for you. Join us at NVIDIA and make a lasting impact on the world of AI and computing. Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com.

The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

NVIDIA

Website: https://www.nvidia.com/

Headquarter Location: Santa Clara, California, United States

Employee Count: 10001+

Year Founded: 1993

IPO Status: Public

Last Funding Type: Grant

Industries: Artificial Intelligence (AI) ⋅ GPU ⋅ Hardware ⋅ Software ⋅ Virtual Reality