π Local Job Near You
Senior Technical Program Manager, Network Monitoring
Amazon
π
Dublin, Ireland
Location
Dublin
Posted
June 06, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
Description
AWS operates one of the worldβs largest and most highly available networks which continues to grow rapidly in both size and complexity in response to customer demand. Many AWS customers run mission critical workloads that depend on our networks to be always on. Network Monitoring and Remediation (NMR) is responsible for preventing, predicting, detecting and remediating impairments across our network, both physical and logical, before they cause impact to our customers. We own the software systems that consume trillions of telemetry datapoints per day from the millions of network devices, use various big-data techniques including ML/AI to turn the telemetry into actionable notifications when things go bad, and package them into our event management system for automated mitigation and remediation. For the less than 2% of detected issues that we havenβt yet automated, we provide a suite of expert diagnostic tools that enable our human operators to quickly determine next ste...
AWS operates one of the worldβs largest and most highly available networks which continues to grow rapidly in both size and complexity in response to customer demand. Many AWS customers run mission critical workloads that depend on our networks to be always on. Network Monitoring and Remediation (NMR) is responsible for preventing, predicting, detecting and remediating impairments across our network, both physical and logical, before they cause impact to our customers. We own the software systems that consume trillions of telemetry datapoints per day from the millions of network devices, use various big-data techniques including ML/AI to turn the telemetry into actionable notifications when things go bad, and package them into our event management system for automated mitigation and remediation. For the less than 2% of detected issues that we havenβt yet automated, we provide a suite of expert diagnostic tools that enable our human operators to quickly determine next ste...