Posted:
June 11, 2026
Location:
london, england, United-Kingdom

Job Description

Requirements

  • Proven experience improving performance in production systems with tight constraints (latency, memory, bandwidth, power/thermal, or cost)
  • Strong proficiency with at least one relevant stack/toolchain (e.g. TensorRT, CUDA, Qualcomm QNN, Triton, OpenCL) and confidence learning adjacent frameworks quickly
  • Comfort operating at multiple levels of abstraction — from high‑level model behaviour down to low‑level kernel/runtime execution
  • Strong software engineering fundamentals (debugging, profiling, testing, and maintainable code)
  • Clear communicator and collaborative teammate; able to align multiple stakeholders on performance trade‑offs and priorities
  • (Desirable) Exposure to embedded or edge deployment of ML models, including benchmarking on real devices and handling system‑level constraints
  • (Desirable) Experience with NVIDIA and/or Qualcomm SoCs and performance tooling
  • (Desirable) Python ...

Apply for this Job

Submit your application for the Staff Machine Learning Performance Engineer (Inference Optimisation) position at Wayve.

Apply Now Save for Later

Job Overview

Job Type: Full-time
Location: london, United-Kingdom
Posted: June 11, 2026
Deadline: July 21, 2026