An
Senior Site Reliability Engineer / HPC - Pre-IPO Tech Leader
Andiamo
Full-time
toronto, on
Other-General
Posted:
May 31, 2026
Location:
toronto, on, Canada
Job Description
Senior Site Reliability Engineer / HPC – Pre‑IPO Tech Leader About The Role We are seeking a highly skilled
Senior Site Reliability Engineer (SRE) / High-Performance Computing (HPC) Engineer
to design, build, and operate the large-scale infrastructure that powers a $2.5B pre‑IPO technology company. Our systems run on massive distributed clusters, handling some of the most demanding workloads in cloud, AI, and data‑driven computing. In this role, you will be responsible for ensuring the reliability, scalability, and performance of mission‑critical platforms. You will optimize HPC workloads, streamline CI/CD for large‑scale clusters, and enable research and product teams to deliver innovations with speed and confidence. This is a hands‑on position with the opportunity to influence architecture, lead reliability initiatives, and solve some of the hardest problems in distributed systems and performance engineering.
What You’ll Do
Design Reliable Infrastr...
Senior Site Reliability Engineer (SRE) / High-Performance Computing (HPC) Engineer
to design, build, and operate the large-scale infrastructure that powers a $2.5B pre‑IPO technology company. Our systems run on massive distributed clusters, handling some of the most demanding workloads in cloud, AI, and data‑driven computing. In this role, you will be responsible for ensuring the reliability, scalability, and performance of mission‑critical platforms. You will optimize HPC workloads, streamline CI/CD for large‑scale clusters, and enable research and product teams to deliver innovations with speed and confidence. This is a hands‑on position with the opportunity to influence architecture, lead reliability initiatives, and solve some of the hardest problems in distributed systems and performance engineering.
What You’ll Do
Design Reliable Infrastr...
Apply for this Job
Submit your application for the Senior Site Reliability Engineer / HPC - Pre-IPO Tech Leader position at Andiamo.
Apply Now Save for LaterJob Overview
Job Type:
Full-time
Location:
toronto, Canada
Posted:
May 31, 2026
Deadline:
July 10, 2026