AI Computing Development Engineer, TensorRT-LLM

NVIDIA

Full-time Shanghai, China other-general

Posted:

June 17, 2026

Location:

Shanghai, China, China

Job Description

                    NVIDIA is hiring software engineers for its AI Computing team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and GenerativeAI that has put DL at the “iPhone moment” for AI. Join the team which is building the inferencing software which will be used across our product lines! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.
  
What you'll be doing:
+ Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance
+ Performance analysis, optimization and tuning
+ Closely follow academic developments in the field of artificial intelligence and feature update TensorRT-LLM
+ Provide feedback into the architecture and hardware design and development
+ Collaborate across the company to guide the direction of machine ...

Apply for this Job

Submit your application for the AI Computing Development Engineer, TensorRT-LLM position at NVIDIA.

Apply Now Save for Later

Job Overview

Job Type: Full-time

Location: Shanghai, China

Posted: June 17, 2026

Deadline: June 23, 2026