Job Description
We are looking for a Senior Site Reliability Engineer (SRE) to help design, scale, and secure our rapidly growing platform infrastructure.
You will work across all critical systems — from customer-facing applications and APIs to internal platforms and data services — ensuring availability, performance, and cost efficiency at scale.
You’ll be hands-on with Kubernetes, observability, GitOps, automation, and cloud infrastructure, while partnering closely with application, platform, and data teams to deliver a highly reliable and self-healing environment.
This role is ideal for an engineer who thrives on complex distributed systems, loves to automate everything, and can balance speed, stability, and cost-efficiency in production.
- Bachelor’s degree in Computer Science, Engineering, or a related field — or equivalent work experience.
- Design, deploy, monitor, and maintain production workloads across Kubernetes (EK...
Apply for this Job
Submit your application for the Data & ML Ops position at Salla.
Apply Now Save for Later