Senior Software Engineer – LLM Evaluation
Nexus ConsultingJob Description
- Title:
Senior Software Engineer – LLM Evaluation (Remote) - Engagement:
Hourly contract (independent contractor) - Location:
Remote
About the Opportunity
One of our global AI research clients is building advanced evaluation and training datasets to improve large language models on realistic software engineering tasks. This project focuses on creating verifiable software engineering challenges derived from public repository histories using a structured, human-in-the-loop approach. The goal is to expand dataset coverage across programming languages, complexity levels, and real-world development scenarios.
Role Overview
We are seeking experienced, tech lead–level software engineers who are comfortable working with high-quality public GitHub repositories (500+ stars). This role combines hands-on engineering work with AI model evaluation, contributing directly to how AI systems interact with real-world codebases.
Wh...
Apply for this Job
Submit your application for the Senior Software Engineer – LLM Evaluation position at Nexus Consulting.
Apply Now Save for Later