Senior Site Reliability Engineer
ReplitAI-powered Development company
RemoteSenior
Software Engineering
About the role
Ensure the reliability, scalability, and performance of Replit's infrastructure.
- •Join our Site Reliability Engineering team and help ensure the reliability, scalability, and performance of Replit's infrastructure that serves millions of developers worldwide.
- •Key Responsibilities Design and Implement Observability Solutions Drive Automation and Infrastructure as Code Establish SLOs and SLIs Incident Management and Response Performance Optimization Requirements 4-8 years of experience in Site Reliability Engineering or similar roles Strong programming skills in Python, Go, or similar Deep understanding of distributed systems Experience with container orchestration platforms (Kubernetes) and cloud-native technologies Proven track record of implementing and maintaining monitoring/observability solutions Strong incident management skills with experience leading incident response Experience with infrastructure as code and configuration management tools
Tech stack
PythonGoKubernetesCI/CD
Match insights
Tech:Python, Go, Kubernetes, CI/CD
Level:Senior