Site reliability engineer

RCS TECH
Full-time mexico city, mexico city Other-General
Posted:
June 02, 2026
Location:
mexico city, mexico city, Mexico

Job Description

What You'll Do Reliability & Operations -Own availability, latency, and scalability across Saa S and AI systems - Define and enforce SLOs, SLIs, and error budgets - Participate in a global on-call rotation (~1 week every 4 weeks) - Lead incident response and drive blameless postmortems with systemic fixes Platform & Infrastructure - Architect and operate on-premise and multi-region, multi-cloud environments - Manage large-scale Kubernetes workloads - Build and evolve infrastructure using Terraform and Ansible - Improve system resilience, fault isolation, and capacity planning AI/ML & Automation - Build and scale agentic AI systems for triage, anomaly detection, and self-healing - Ensure reliability of model serving infrastructure - Operate, optimize and scale distributed systems What You Bring -5+ years in SRE, Production Engineering, or Platform Engineering - Strong experience with cloud providers (AWS/GCP/OCI), Kubernetes, and Ia C (Terraform/Ansible) - Proficiency in Python, Go, or ...

Apply for this Job

Submit your application for the Site reliability engineer position at RCS TECH.

Apply Now Save for Later

Job Overview

Job Type: Full-time
Location: mexico city, Mexico
Posted: June 02, 2026
Deadline: July 12, 2026