Staff Machine Learning Performance Engineer (Inference Optimisation)

Wayve

Full-time london, england Other-General

Posted:

June 11, 2026

Location:

london, england, United-Kingdom

Job Description

Requirements Proven experience improving performance in production systems with tight constraints (latency, memory, bandwidth, power/thermal, or cost) 
Strong proficiency with at least one relevant stack/toolchain (e.g. TensorRT, CUDA, Qualcomm QNN, Triton, OpenCL) and confidence learning adjacent frameworks quickly 
Comfort operating at multiple levels of abstraction — from high‑level model behaviour down to low‑level kernel/runtime execution 
Strong software engineering fundamentals (debugging, profiling, testing, and maintainable code) 
Clear communicator and collaborative teammate; able to align multiple stakeholders on performance trade‑offs and priorities 
(Desirable) Exposure to embedded or edge deployment of ML models, including benchmarking on real devices and handling system‑level constraints 
(Desirable) Experience with NVIDIA and/or Qualcomm SoCs and performance tooling 
(Desirable) Python ...
                

Apply for this Job

Submit your application for the Staff Machine Learning Performance Engineer (Inference Optimisation) position at Wayve.

Apply Now Save for Later

Job Overview

Job Type: Full-time

Location: london, United-Kingdom

Posted: June 11, 2026

Deadline: July 21, 2026