Job Description
We are seeking an experienced GPU Performance Engineer with a strong background in Python and large-scale model training. In this role, you will design and implement improvements to our training infrastructure and directly contribute to technical decisions that optimize performance of our models. You will also work on post-training processes, including reinforcement learning and fine-tuning. Furthermore, you will contribute to improving the efficiency and scalability of our model serving infrastructure.
Ideal Experience
Strong engineering skills with fluency in Python and PyTorch (or other frameworks).
Proven experience implementing and training large deep learning models.
Experience writing and debugging low-level GPU code (CUDA, C++).
Experience scaling up GPU jobs using large-scale compute clusters (e.g., Slurm or Kubernetes).
Demonstrated ability to analyze and optimize the performance of G...
Apply for this Job
Submit your application for the Member of Technical Staff position at Reka.
Apply Now Save for Later