Re
Freelance AI Evaluation Architect
Reconocida empresa
Full-time
alto hospicio, alto hospicio
Other-General
Posted:
June 15, 2026
Location:
alto hospicio, alto hospicio, Chile
Job Description
Empresa confidencial connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.
What this opportunity involves- We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks.
- You’ll create challenging tasks and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history.
- Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is solvable and the evaluation is fair.
- Design tasks set in isolated environments — emulations of a developer's workstation: a Linux machine with developmen...
Apply for this Job
Submit your application for the Freelance AI Evaluation Architect position at Reconocida empresa.
Apply Now Save for LaterJob Overview
Job Type:
Full-time
Location:
alto hospicio, Chile
Posted:
June 15, 2026
Deadline:
July 25, 2026