Location
Sunnyvale
Posted
June 04, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
**Position Summary...**
Walmart processes more transactions in a day than most companies handle in a year. When performance degrades or systems fail, the impact is immediate β measured in millions of dollars and hundreds of millions of customers. We're building the team that prevents that using agentic AI.
As a Principal Engineer in Performance and Resiliency Engineering, you'll architect and lead the development of intelligent, self-healing systems: LLM-based agents that detect anomalies, reason across observability data, and trigger automated remediation β without waiting for a human in the loop. You'll operate at a scale most AI engineers never encounter: 10,500 stores, 240M weekly customers, and infrastructure that powers one of the world's largest retail ecosystems.
This isn't a research role or a proof-of-concept environment. You'll own the technical strategy, set architectural direction, and ship to production β building agentic systems that directly impact ...
Walmart processes more transactions in a day than most companies handle in a year. When performance degrades or systems fail, the impact is immediate β measured in millions of dollars and hundreds of millions of customers. We're building the team that prevents that using agentic AI.
As a Principal Engineer in Performance and Resiliency Engineering, you'll architect and lead the development of intelligent, self-healing systems: LLM-based agents that detect anomalies, reason across observability data, and trigger automated remediation β without waiting for a human in the loop. You'll operate at a scale most AI engineers never encounter: 10,500 stores, 240M weekly customers, and infrastructure that powers one of the world's largest retail ecosystems.
This isn't a research role or a proof-of-concept environment. You'll own the technical strategy, set architectural direction, and ship to production β building agentic systems that directly impact ...