Freelance Agent Evaluation Engineer

Mindrift
Full-time buenos aires, espírito santo Engenharia de qualidade
Posted:
June 01, 2026
Location:
buenos aires, espírito santo, Brazil

Job Description

Please submit your CV in English and indicate your level of English proficiency.

What This Opportunity Involves

  • Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history
  • Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is solvable and the evaluation is fair
  • Design tasks set in isolated environments - emulations of a developer's workstation: a Linux machine with development tools (terminal, CLI), MCP servers (repository, task tracker, messenger, documentation, etc.), and a real web application codebase
  • Write tests that accept all correct solutions and reject incorrect ones - neither too strict (breaking on valid approaches) nor too lenient (passing bad ones)
  • Iterate with an AI agent on tests - ...

Apply for this Job

Submit your application for the Freelance Agent Evaluation Engineer position at Mindrift.

Apply Now Save for Later

Job Overview

Job Type: Full-time
Location: buenos aires, Brazil
Posted: June 01, 2026
Deadline: July 11, 2026