📍 Local Job Near You

Engineering Manager, LLM Performance

🏢

NVIDIA

📍 Santa Clara, United States

📍

Location Santa Clara

📅

Posted June 24, 2026

🚗

Commute Local Area

🎯

Local Opportunity Near You!

This job is in your area. Enjoy a short commute and work close to home.

📋
Job Description

                    At NVIDIA, we   aren't   just powering the AI revolution— we're   accelerating it.   We are accelerating LLM inference across the stack   and across all   open source   LLM frameworks like TensorRT LLM,   vLLM   and   SGLang .   With demand for AI exploding, particularly in the realm of large language models (LLMs) and vision language models (VLMs, VLAs), we are significantly expanding our team.   
  
 We're   seeking   a highly skilled and driven Engineering Manager to take the lead in   accelerating   the next generation of LLM/VLM/VLA inference software technologies that will define the future of AI. This is a high-impact, hands-on leadership role at the intersection of deep technical   expertise   and world-class management. You   won't   just manage;   you'll   architect and guide a brilliant team of engineers who are   pushing the performance of   LLM inference. Your work will be highly collaborative, interfacing directly with NVIDIA Researchers, GPU Architects, and o...

Apply for This Job

Submit Application

Quick and secure application process

📍 Location Details

🌆

City

Santa Clara

🗺️

Country

United States

🚗

Commute

Local Area

🔍 More Jobs Nearby

Explore other opportunities in Santa Clara

View Local Jobs

Engineering Manager, LLM Performance

📋 Job Description

Apply for This Job

📍 Location Details

🔍 More Jobs Nearby

📋
Job Description