π Local Job Near You
Sr. SDM, AI Inference Technology, Neuron SDK
Amazon
π
Seattle, United States
Location
Seattle
Posted
June 01, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
Description
AWS Utility Computing (UC) provides product innovations β from foundational services such as Amazon Elastic Compute Cloud (EC2), to new product innovations that continue to set AWSβs services and features apart in the industry.
Come develop inference acceleration for AWS Neuron, the complete software stack for Trainium, Amazon's custom cloud-scale machine learning accelerators that power the latest AI models
As the Sr. SDM for the Inference Technology Team, you will lead a strong team of managers and engineers to build fundamental inference technology building blocks and libraries to enable AI developers to optimize model for inference on Trainium and Inferentia devices. You will be responsible for the full development life cycle of inference library and feature development, including reliability and scalability. You will develop the Neuronx_Distributed Inference Libraries and contribute to other popular open source Inference Libraries, enabling custo...
AWS Utility Computing (UC) provides product innovations β from foundational services such as Amazon Elastic Compute Cloud (EC2), to new product innovations that continue to set AWSβs services and features apart in the industry.
Come develop inference acceleration for AWS Neuron, the complete software stack for Trainium, Amazon's custom cloud-scale machine learning accelerators that power the latest AI models
As the Sr. SDM for the Inference Technology Team, you will lead a strong team of managers and engineers to build fundamental inference technology building blocks and libraries to enable AI developers to optimize model for inference on Trainium and Inferentia devices. You will be responsible for the full development life cycle of inference library and feature development, including reliability and scalability. You will develop the Neuronx_Distributed Inference Libraries and contribute to other popular open source Inference Libraries, enabling custo...