AI Systems Reliability Engineer Position

Tenstorrent
Full-time toronto, on Engineering
Posted:
May 23, 2026
Location:
toronto, on, Canada

Job Description

Become a Site Reliability Engineer to support cutting-edge AI technologies. Ensure system reliability and operational effectiveness utilizing your Linux and automation skills in a hybrid setup.

In this role, you will focus on the intersection of reliability and customer engineering, validating that our AI systems are production-ready. Engaging with internal teams, you will tackle complex issues and enhance monitoring and automation processes, contributing significantly to system performance and reliability.

Key Responsibilities:
• Maintain operational integrity of AI infrastructures
• Troubleshoot issues spanning compute, network, and software
• Collaborate with teams for incident response
• Enhance monitoring and observability frameworks
• Create automation solutions to boost reliability

Requirements:
• Expertise in site reliability or systems engineering
• Advanced Linux troubleshooting capabilities
...

Apply for this Job

Submit your application for the AI Systems Reliability Engineer Position position at Tenstorrent.

Apply Now Save for Later

Job Overview

Job Type: Full-time
Location: toronto, Canada
Posted: May 23, 2026
Deadline: July 02, 2026