Posted:
May 22, 2026
Location:
toronto, on, Canada

Job Description

Location: Downtown Toronto
Hybrid: 4 days in office

Ready to build what powers the next generation of AI?

We’re looking for a Staff LLMOps Engineer to lead the design, deployment, and optimization of large language model (LLM) infrastructure on the cloud.
You’ll be the driving force behind taking trained models from lab to production—scaling efficiently across multi-GPU clusters and pushing the boundaries of inference performance for enterprise-grade AI applications.

If you thrive at the intersection of AI, cloud engineering, and systems optimization , this is your chance to shape the future of large-scale model serving in a high-impact environment.

What You’ll Do

Architect and operationalize LLM deployment pipelines on AWS and Kubernetes/EKS.

Build and scale multi-GPU inference infrastructure for low latency, high availability, and cost efficie...

Apply for this Job

Submit your application for the Senior LLMOps Engineer -Cloud / AI Infrastructure position at TEEMA Solutions Group.

Apply Now Save for Later

Job Overview

Job Type: Full-time
Location: toronto, Canada
Posted: May 22, 2026
Deadline: July 01, 2026