Posted:
3/6/2024, 4:00:00 PM
Location(s):
Champaign, Illinois, United States ⋅ Illinois, United States ⋅ California, United States
Experience Level(s):
Expert or higher ⋅ Senior
Field(s):
AI & Machine Learning ⋅ Software Engineering
We are seeking expert System Software Engineers to join our Apache Spark Acceleration team. Data scientists spend a considerable amount of time exploring data and iterating over machine learning (ML) experiments. NVIDIA believes that data science and analytics workflows can benefit tremendously from being accelerated, to enable data users to explore more and larger datasets to drive towards their business goals faster and more optimally.
You will work with the open source community to accelerate Apache Spark with GPUs for data science. Apache Spark is the most popular data processing engine in data centers. We strive to significantly accelerate Apache Spark 3.x use cases without application code changes. You will work on open source libraries (such as https://nvidia.github.io/spark-rapids/) to be used in both on-premises and cloud services (such as Databricks, AWS EMR, Google Dataproc, and Cloudera).
What you'll be doing:
Leading the design and implementation of accelerated Apache Spark and related big-data frameworks
Creating a collection of accelerated libraries for data analytics and machine learning
Working with a team of outstanding engineers including PMC and Committers of Apache Spark, Apache Hadoop, Apache Hive, and Apache Arrow
Engaging open source communities (including Apache Spark, RAPIDS and UCX) for technical discussion and contribution
Working with NVIDIA strategic partners on deploying advanced machine learning and data analytics solutions in public cloud or on-premise clusters
Presenting technical solutions in industry conferences and meetups
Provide recommendations and feedback to teams regarding decisions surrounding topics such as infrastructure, continuous integration and testing strategy
Build, test and optimize CUDA/C++ libraries across different platforms
What we need to see:
BS, MS, or PhD in Computer Science, Computer Engineering, or closely related field or equivalent experience
15+ years of work experience in software development
5+ years working experience with key open source big-data projects as a contributor or committer including Apache Spark, Apache Flink, Trino, Apache Kafka, Apache Hive, Apache Arrow, Apache Hadoop, Delta Lake, Apache Iceberg
Outstanding technical skills in designing and implementing high-quality distributed systems
Excellent programming skills in C++, Java, and/or Scala
Ability to work successfully with multi-functional teams across organizational boundaries and geographies
Highly motivated with strong interpersonal skills
You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Website: https://www.nvidia.com/
Headquarter Location: Santa Clara, California, United States
Employee Count: 10001+
Year Founded: 1993
IPO Status: Public
Last Funding Type: Grant
Industries: Artificial Intelligence (AI) ⋅ GPU ⋅ Hardware ⋅ Software ⋅ Virtual Reality