AI Evaluation Engineer

FirstIgnite
Full-time Remote, Remote Ingeniería de calidad
Posted:
June 03, 2026
Location:
Remote, Remote, Mexico

Job Description

The Role

We're hiring an AI Evaluation Engineer to own the quality bar for every LLM-powered feature we ship. You'll design, build, and scale the infrastructure that tells us — with evidence — whether a prompt change, model swap, or agent refactor made things better or worse.

This is a high-leverage role. Every customer-facing AI capability at FirstIgnite flows through your evals. You'll work directly with the Head of Engineering and partner closely with product, applied AI, and the full-stack team to establish evaluation as a first-class discipline across the company.

What You'll Do

  • Build evaluation infrastructure: Design and maintain eval suites using Promptfoo, LLM-as-judge methodologies, and custom harnesses for features like our expert search system, natural language grants search, and AI SDR agents.
  • Define what good means: Partner with product and domain experts to translate fuzzy customer outcomes (does this surface the right p...

Apply for this Job

Submit your application for the AI Evaluation Engineer position at FirstIgnite.

Apply Now Save for Later

Job Overview

Job Type: Full-time
Location: Remote, Mexico
Posted: June 03, 2026
Deadline: July 13, 2026