Senior Site Reliability Engineer (SRE) Multi-Cloud
This job is in your area. Enjoy a short commute and work close to home.
Job Description
100% remote
-
Lead the design, deployment, and maintenance of highly available, multi-site infrastructure across GCP (preferred) and AWS/Azure environments.
-
Manage and optimize Kubernetes clusters, implement automated deployments, and ensure robust observability and monitoring across cloud services.
-
Build, maintain, and enhance infrastructure as code using Terraform, along with scripting and automation in Python, Go, Node.js, or similar languages.
-
Troubleshoot and resolve complex system incidents, ensuring reliability, scalability, and security of critical applications.
-
Collaborate with engineering, DevOps, and security teams to establish best practices for CI/CD pipelines, multi-cloud architecture, and operational workflows.
-
<...