Evaluation Scenario Writer - AI Agent Testing Specialist

Mindrift
Full-time WorkFromHome, WorkFromHome IT & Technology
Posted:
March 02, 2026
Location:
WorkFromHome, WorkFromHome, Norway

Job Description

Please submit your CV in English and indicate your level of English proficiency.

Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.

What This Opportunity Involves

  • Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements and information sources
  • Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases, not just superficial checks
  • Craft fair but hard challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required)
  • Analyze AI failures to understand what the model struggles with vs. what it masters
  • Iterate based on feedback from expert QA reviewers who score your work o...

Apply for this Job

Submit your application for the Evaluation Scenario Writer - AI Agent Testing Specialist position at Mindrift.

Apply Now Save for Later

Job Overview

Job Type: Full-time
Location: WorkFromHome, Norway
Posted: March 02, 2026
Deadline: April 11, 2026