Am
Systems Development Engineer, Research Compute Platform, Fauna
Amazon
Full-time
New York, NY
other-general
Posted:
June 13, 2026
Location:
New York, NY, United States
Job Description
Description
We are seeking a Systems Development Engineer to own the research compute platform for Fauna Robotics. You will build and operate the physical and virtual infrastructure that our ML scientists use to train reinforcement learning policies for real robots, from fleet provisioning and job scheduling to cloud burst capacity and environment reproducibility.
This role requires both strong systems engineering fundamentals and genuine comfort working alongside researchers. The ideal candidate is as happy diagnosing a GPU thermal fault as they are designing a job scheduler, and treats “the scientist’s training run just works” as the north star for everything they build.
Key job responsibilities
- Own on-prem GPU compute end-to-end: provisioning, imaging, driver and CUDA management, monitoring, failure diagnosis, hardware RMA, and capacity planning
- Build and operate a job scheduling layer (Slurm, Ray, SkyPilot, or equivalent) so scientists submit tr...
We are seeking a Systems Development Engineer to own the research compute platform for Fauna Robotics. You will build and operate the physical and virtual infrastructure that our ML scientists use to train reinforcement learning policies for real robots, from fleet provisioning and job scheduling to cloud burst capacity and environment reproducibility.
This role requires both strong systems engineering fundamentals and genuine comfort working alongside researchers. The ideal candidate is as happy diagnosing a GPU thermal fault as they are designing a job scheduler, and treats “the scientist’s training run just works” as the north star for everything they build.
Key job responsibilities
- Own on-prem GPU compute end-to-end: provisioning, imaging, driver and CUDA management, monitoring, failure diagnosis, hardware RMA, and capacity planning
- Build and operate a job scheduling layer (Slurm, Ray, SkyPilot, or equivalent) so scientists submit tr...
Apply for this Job
Submit your application for the Systems Development Engineer, Research Compute Platform, Fauna position at Amazon.
Apply Now Save for LaterJob Overview
Job Type:
Full-time
Location:
New York, United States
Posted:
June 13, 2026
Deadline:
June 18, 2026