π Local Job Near You
Reinforcement Learning Systems
Microsoft Corporation
π
Multiple Locations, United States
Location
Multiple Locations
Posted
June 10, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
**Overview**
Microsoft AI is looking for a Member of Technical Staff β Reinforcement Learning Systems to help build the worldβs most advanced reinforcement learning systems. We are responsible for designing, developing, and operating the large-scale reinforcement learning systems that power several use cases across the Superintelligence team β from training trustworthy and capable agents and powerful reasoning models to helpful and conversational assistants.
We are looking for individuals who can contribute to cutting-edge research and help bridge the gap between cutting-edge research and robust, production-grade distributed systems. The ideal candidate has both distributed systems expertise and a scientific mindset and will be able to build complex and scalable systems from the ground up, identify and resolve performance bottlenecks, debug complex, cross-system issues with extremely high attention to detail, and contribute to solving scientific and research challenge...
Microsoft AI is looking for a Member of Technical Staff β Reinforcement Learning Systems to help build the worldβs most advanced reinforcement learning systems. We are responsible for designing, developing, and operating the large-scale reinforcement learning systems that power several use cases across the Superintelligence team β from training trustworthy and capable agents and powerful reasoning models to helpful and conversational assistants.
We are looking for individuals who can contribute to cutting-edge research and help bridge the gap between cutting-edge research and robust, production-grade distributed systems. The ideal candidate has both distributed systems expertise and a scientific mindset and will be able to build complex and scalable systems from the ground up, identify and resolve performance bottlenecks, debug complex, cross-system issues with extremely high attention to detail, and contribute to solving scientific and research challenge...