Job Description
Our Mission
Reflection's mission is to build open superintelligence and make it accessible to all.
We're developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond.
About the Role
-
Drive the entire alignment stack, spanning instruction tuning, RLHF, and RLAIF, to push the model toward high factual accuracy and robust instruction following.
-
Lead research efforts to design next-generation reward models and optimization objectives that significantly improve human preference (HP) performance.
-
Curate high-quality training data and design synthetic data pipelines that solve complex reasoning and behavioral gaps.
-
Optimize large-scale RL pipelines for stability and efficiency, ensuring rapid iteration cycles fo...
Apply for this Job
Submit your application for the Member of Technical Staff position at Reflection.
Apply Now Save for Later