Senior Software Engineer, AI Inference

NVIDIA Gruppe
Full-time toronto, on IT & Technology
Posted:
June 04, 2026
Location:
toronto, on, Canada

Job Description

Help us push the boundaries of AI inference at NVIDIA — where your systems expertise shapes both the technology and the teams building on top of it! We're looking for a Senior Software Engineer to work at the frontier of large-scale LLM serving, partnering directly with some of the world's most technically demanding customers to unlock the full performance potential of NVIDIA's inference stack. In this role, you'll combine deep systems knowledge with hands‑on customer engagement — profiling real deployments, benchmarking across GPU clusters, and turning insights into improvements that ripple across the open-source ecosystem. Do you love digging into performance problems that don't have obvious answers, and want your work to have an impact far beyond a single codebase? We'd love to talk. Unlike traditional customer‑facing engineering roles, we expect you to go far deeper — contributing to vLLM, NVIDIA Dynamo, and the tooling that makes every engineer on your team more effective. <...

Apply for this Job

Submit your application for the Senior Software Engineer, AI Inference position at NVIDIA Gruppe.

Apply Now Save for Later

Job Overview

Job Type: Full-time
Location: toronto, Canada
Posted: June 04, 2026
Deadline: July 14, 2026