This job is in your area. Enjoy a short commute and work close to home.
Job Description
Senior / Staff Site Reliability Engineer | Β£136kβΒ£180k + equity | Remote Europe or London
We're partnering with a fast-growing developer infrastructure startup on a senior SRE hire at a pivotal moment in their growth.
The platform runs AI agents and background workflows in production at massive scale handling hundreds of millions of executions per month on infrastructure they run themselves. The team is ~13 people. No engineering managers. Engineers own large parts of the system and work directly with the founders.
The core challenge right now is scale. Execution volume is growing faster than the team can build, which means the next hires are walking into genuine distributed systems problems β not a greenfield rebuild or a dashboard feature.
What you'll be working on
- Owning observability across the platform OpenTelemetry, metrics, logs, traces, and making them genuinely useful at 3am
- Designing and o...