Senior Site Reliability Engineer
Doghouse RecruitmentJob Description
Site Reliability Engineer – Bare Metal Linux – Data Center – Networking – €180k
Our client is building a cloud platform for high-throughput, compute-heavy workloads. They operate large-scale infrastructure where failure modes are real, capacity is finite, and reliability needs to be engineered, not handled.
We're seeking a Senior SRE who will own production reliability end-to-end for our client: define SLIs/SLOs, run error budget conversations, and ship changes that reduce incidents and improve latency (p95/p99). You'll build automation to kill toil, improve deployment safety (canary/rollback), and turn observability into signal rather than noise.
This is a bare-metal environment: think Linux, datacenters, physical fleets, and real hardware constraints, not managed services. You'll work close to the metal across Kubernetes internals (scheduling, autoscaling behavior, kubelet pressure/evictions, etcd/control plane), Linux...
Apply for this Job
Submit your application for the Senior Site Reliability Engineer position at Doghouse Recruitment.
Apply Now Save for Later