Posted:
9/27/2024, 8:56:14 AM
Location(s):
Florida, United States ⋅ New York, United States ⋅ North Carolina, United States ⋅ New York, New York, United States ⋅ California, United States
Experience Level(s):
Senior
Field(s):
AI & Machine Learning
Workplace Type:
Remote
NVIDIA’s Retriever team is seeking a Senior Applied Research Scientist with experience researching, developing, and deploying deep learning models at scale across a range of modalities. You’ll join a team of Applied Research Scientists, Machine Learning and MLOps Engineers working on the next generation of retrieval pipelines for RAG, with a focus on modalities beyond text.
At NVIDIA we’re building the framework upon which production RAG systems are based. We have contributed to top research models in the text embedding space, topping the MTEB leaderboard and have developed commercially viable versions of these models for use in production systems by our customers. Come be a part of our world-class team building the future of Retrieval.
What you’ll be doing:
Working with our team of researchers to develop efficient and performant models and pipelines that extract text content from images, video, audio and other modalities.
Exploring and crafting datasets, metrics, experiments, and validation scripts to develop standard methodologies for research. These methodologies will offer customers clear guidance on which models and pipelines to apply in specific contexts.
Helping ML Engineers scale pipelines to production capability through the development of NVIDIA Inference Microservices (NIMs) and blueprints which demonstrate how to deploy NIMs in a pipeline effectively.
Writing papers, blog posts, documentation and trainings that help customers understand and take advantage of our research.
Keeping up to date with the latest developments in Retrieval across academia and industry.
What we need to see:
Candidates with a Master's, Ph.D. or equivalent experience in retrieval or multimodal research are preferred, along with a track record of publication in leading conferences like SIGIR, KDD, UMAP, RecSys, etc.
An understanding of the state of the art in retrieval research, with a focus on multimodal content retrieval.
3+ years of experience developing multimodal systems across a range of models and platforms. Information retrieval experience is a big plus.
Knowledge of best practices in batching, streaming, and scaling of ingestion pipelines to support real-world applications.
Excellent Python programming skills and a strong understanding of the Python deep learning ecosystem (PyTorch, Tensorflow, MXNet, etc).
An ability to share and communicate your ideas clearly through blog posts, papers, kernels, GitHub, etc.
Excellent communication and interpersonal skills are required, along with the ability to work in a dynamic, product-oriented, distributed team. A history of mentoring junior engineers and interns is a plus.
With a competitive salary package and benefits, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Are you a creative and autonomous Applied Scientist, who loves challenges? Do you have a genuine passion for advancing the state of GPU and CPU across a variety of industries? If so, we want to hear from you.
The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Website: https://www.nvidia.com/
Headquarter Location: Santa Clara, California, United States
Employee Count: 10001+
Year Founded: 1993
IPO Status: Public
Last Funding Type: Grant
Industries: Artificial Intelligence (AI) ⋅ GPU ⋅ Hardware ⋅ Software ⋅ Virtual Reality