π Local Job Near You
Senior High-Performance LLM Training Engineer
NVIDIA
π
Santa Clara, United States
Location
Santa Clara
Posted
May 27, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
We are now looking for a Senior High-Performance LLM Training Engineer!
NVIDIA is seeking experienced engineers specializing in performance analysis and optimization to improve the efficiency of LLM training workloads, which are shaping the world's most advanced computing systems. This position focuses on optimizing NVIDIAβs high-performance LLM software stack in frameworks like PyTorch and JAX for high-performance training on thousands of GPUs, while also helping shape hardware roadmaps for the next generation of GPUs powering the AI revolution.
What you will be doing:
+ Understand, analyze, profile, and optimize AI training workloads on innovative hardware and software platforms.
+ Understand the big picture of training performance on GPUs, prioritizing and then solving problems across all state-of-the-art neural networks.
+ Implement production-quality software in multiple layers of NVIDIA's deep learning platform stack, from drivers to DL fram...
NVIDIA is seeking experienced engineers specializing in performance analysis and optimization to improve the efficiency of LLM training workloads, which are shaping the world's most advanced computing systems. This position focuses on optimizing NVIDIAβs high-performance LLM software stack in frameworks like PyTorch and JAX for high-performance training on thousands of GPUs, while also helping shape hardware roadmaps for the next generation of GPUs powering the AI revolution.
What you will be doing:
+ Understand, analyze, profile, and optimize AI training workloads on innovative hardware and software platforms.
+ Understand the big picture of training performance on GPUs, prioritizing and then solving problems across all state-of-the-art neural networks.
+ Implement production-quality software in multiple layers of NVIDIA's deep learning platform stack, from drivers to DL fram...