πŸ“ Jobs Near Me
πŸ“

HiringNearMe.work

Local Jobs, Zero Commute

πŸ“ Local Job Near You

Principal Software Engineer - AI Inference

🏒
NVIDIA
πŸ“ Santa Clara, United States
πŸ“
Location Santa Clara
πŸ“…
Posted June 01, 2026
πŸš—
Commute Local Area
🎯
Local Opportunity Near You!

This job is in your area. Enjoy a short commute and work close to home.

πŸ“‹
Job Description

NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang. You will ensure they run outstandingly on NVIDIA GPUs and systems. You will also strengthen the underlying stack for high-throughput, low-latency inference at scale.


This is a hands-on, deeply technical role for someone who excels at the intersection of inference runtime architecture, GPU performance engineering, and distributed systems. You will collaborate closely with internal model teams, infrastructure/SRE, and product to ensure NVIDIA platforms are outstanding members in the broader inference ecosystem. You will also deliver production-grade improvements that benefit both NVIDIA and the community.


What you'll be doing:
+ Drive upstream-first engineering in vLLM/SGLang: author and land PRs or equivalent experience, eng...

Apply for This Job

Submit Application

Quick and secure application process

πŸ“ Location Details

πŸŒ†
City
Santa Clara
πŸ—ΊοΈ
Country
United States
πŸš—
Commute
Local Area

πŸ” More Jobs Nearby

Explore other opportunities in Santa Clara

View Local Jobs