Location
Shanghai
Posted
June 27, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
NVIDIA is developing processor and system architectures that accelerate deep learning on edge devices, workstations, and data center GPUs for a variety of applications including automotive, robotics, large language models and AI generative models. We are looking for an expert deep learning system performance architect to join our deep learning modelling, performance optimization, projections, and analysis effort. In this position, you will have the chance to optimize deep learning hardware and software architecture and make the significant impact in a dynamic technology focused company
What youβll be doing:
+ Benchmark and analyze performance of various machine learning/deep learning workloads across GPU- and NPU-based architectures
+ Build and validate performance models, and deliver performance projections and insights for deep learning (LLM/GenAI) workloads on emerging architectures
+ Identify architecture, software and system performance bottlenecks and prop...
What youβll be doing:
+ Benchmark and analyze performance of various machine learning/deep learning workloads across GPU- and NPU-based architectures
+ Build and validate performance models, and deliver performance projections and insights for deep learning (LLM/GenAI) workloads on emerging architectures
+ Identify architecture, software and system performance bottlenecks and prop...