Senior Site Reliability Engineer (SRE)

Estuary

Estuary

Software Engineering

United States · Remote

Posted on Mar 11, 2026
Estuary is a real-time data integration platform combining CDC, stream processing, and declarative configuration into a unified system. The Role: Join as a foundational contributor to the infrastructure that powers Estuary, bridging backend engineering and operational excellence; architect systems ensuring our real-time data engine remains resilient, secure, and performant at enterprise scale. Be a key participant in incident response and a strategic driver in maturing detection, triage, and recovery workflows; build "self-healing" platforms that let developers move fast without breaking things. What You’ll Do: Architect for Resilience — design and manage multi-cloud infrastructure (AWS, GCP, Azure) for high availability and security; Evolve Incident Operations — on-call rotations, blameless postmortems, and system hardening; Build Automation & Tooling — internal tools and robust CI/CD; Master Infrastructure as Code — Pulumi and Kubernetes; Drive Observability — Prometheus, Grafana, OpenTelemetry; Collaborate & Mentor — establish operational best practices with backend and platform teams. What We’re Looking For: 8+ years of SRE/systems experience operating distributed systems at scale; Deep understanding of Linux internals, networking (gRPC, TCP/IP), and file systems; Proficiency in Go and scripting (Python/Bash); Kubernetes expertise managing stateful workloads; Infrastructure as Code mastery (Pulumi or similar); Process improvement mindset for operational maturity; Clear communication under pressure. Bonus Points: Rust experience; Data infrastructure background (CDC, Kafka, Flink); Multi-cloud mastery (AWS, GCP, Azure); Startup grit. Why Estuary: Competitive compensation, equity, full benefits; Flexible remote work; High-autonomy environment with direct product impact; Quarterly team offsites (Miami, Austin, Boulder, New Orleans).