This role is no longer accepting applications via Rocketlist.
Senior Site Reliability Engineer (SRE)
Finite StateSoftware Supply company
RemoteSenior
Software Engineering
About the role
Define and drive a modern observability and reliability strategy for an AI-first development organization.
- •Finite State partners with product security teams to create transparency for their connected devices and supply chains.
- •We are seeking a Senior Site Reliability Engineer (SRE) / Infrastructure Engineering leader to define, architect, and drive a modern observability and reliability strategy for an AI-first development organization.
- •Key Responsibilities Design modern telemetry pipelines and implement a comprehensive observability framework.
- •Establish and operationalize meaningful SLIs, SLOs, and SLAs aligned with business objectives.
- •Architect and implement scalable cloud infrastructure primarily within AWS, working closely with platforms like Vercel and Supabase.
- •Champion the use of AI tools to accelerate infrastructure provisioning, improve operational workflows, and enhance observability signal quality.
- •Define and evolve incident management processes, including on-call structures and postmortems.
- •Requirements Deep experience in reliability engineering, distributed systems, and production operations.
- •Forward-thinking mindset around AI-assisted development and infrastructure-as-code.
- •Experience with observability tooling including Honeycomb and Grafana.
Tech stack
AWSSupabaseGrafanaTerraformCloudFormationPulumiEC2S3RDSECSEKSGKECloud RunBigQueryRedshiftCI/CDDockerKubernetesPrometheusDatadogNew RelicPagerDutyNginxHAProxy
Match insights
Tech:AWS, Supabase, Grafana, Terraform, CloudFormation
Level:Senior