NV
Posted:
June 17, 2026
Location:
Shanghai, China, China
Job Description
NVIDIA is hiring software engineers for its AI Computing team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and GenerativeAI that has put DL at the “iPhone moment” for AI. Join the team which is building the inferencing software which will be used across our product lines! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.
What you'll be doing:
+ Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance
+ Performance analysis, optimization and tuning
+ Closely follow academic developments in the field of artificial intelligence and feature update TensorRT-LLM
+ Provide feedback into the architecture and hardware design and development
+ Collaborate across the company to guide the direction of machine ...
What you'll be doing:
+ Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance
+ Performance analysis, optimization and tuning
+ Closely follow academic developments in the field of artificial intelligence and feature update TensorRT-LLM
+ Provide feedback into the architecture and hardware design and development
+ Collaborate across the company to guide the direction of machine ...
Apply for this Job
Submit your application for the AI Computing Development Engineer, TensorRT-LLM position at NVIDIA.
Apply Now Save for LaterJob Overview
Job Type:
Full-time
Location:
Shanghai, China
Posted:
June 17, 2026
Deadline:
June 23, 2026