π Local Job Near You
Senior Software Engineer, Machine Learning Inference
NVIDIA
π
Santa Clara, United States
Location
Santa Clara
Posted
May 30, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
At NVIDIA, we're at the forefront of innovation, driving advancements in AI and machine learning to solve some of the worldβs most challenging problems. We're seeking talented and motivated engineers to join our TensorRT team in developing the industry-leading deep learning inference software for NVIDIA AI accelerators.
As a Senior Software Engineer in the TensorRT team, you will be responsible for designing and implementing inference software optimizations to power AI applications on NVIDIA GPUs. If you're ready to take on challenging projects and make a significant impact in a company that values creativity, excellence, and collaboration, we want to hear from you!
What youβll be doing:
+ Design, develop and optimize NVIDIA TensorRT and TensorRT-LLM to supercharge inference applications for datacenter, workstations, and PCs.
+ Develop software in C++, Python, and CUDA for seamless and efficient deployment of state-of-the-art LLMs and Generative A...
As a Senior Software Engineer in the TensorRT team, you will be responsible for designing and implementing inference software optimizations to power AI applications on NVIDIA GPUs. If you're ready to take on challenging projects and make a significant impact in a company that values creativity, excellence, and collaboration, we want to hear from you!
What youβll be doing:
+ Design, develop and optimize NVIDIA TensorRT and TensorRT-LLM to supercharge inference applications for datacenter, workstations, and PCs.
+ Develop software in C++, Python, and CUDA for seamless and efficient deployment of state-of-the-art LLMs and Generative A...