Senior Performance Architect, Nemotron

NVIDIA
Full-time Santa Clara, CA other-general
Posted:
June 17, 2026
Location:
Santa Clara, CA, United States

Job Description

We are now looking for a Senior Performance Architect for Nemotron! At NVIDIA, we are redefining the future of AI systems through deep model–system–hardware co-design. We are looking for a forward-thinking Nemotron Performance Architect to shape the next generation of Nemotron models through performance modeling, analysis, and forward projections. In this role, you will predict before we build - developing high-fidelity models to evaluate how architectural choices translate into real-world deployment efficiency. You will ensure that future models achieve Pareto-optimal trade-offs across accuracy, throughput, and interactivity on target platforms.


Recent efforts such as LatentMoE (https://research.nvidia.com/labs/nemotron/LatentMoE/) architectures and the Nemotron Super (https://developer.nvidia.com/blog/introducing-nemotron-3-super-an-open-hybrid-mamba-transformer-moe-for-agentic-reasoning/) model exemplify the kind of performance-driven co-design you will help advance...

Apply for this Job

Submit your application for the Senior Performance Architect, Nemotron position at NVIDIA.

Apply Now Save for Later

Job Overview

Job Type: Full-time
Location: Santa Clara, United States
Posted: June 17, 2026
Deadline: June 23, 2026