AI Evaluation & Data Engineer for LLM Metrics

Net2Source (N2S)
Full-time región centro, jalisco Bases de datos, analítica y BI
Posted:
June 05, 2026
Location:
región centro, jalisco, Mexico

Job Description

We are looking for AI Evaluation & Data Engineering Specialists to design, curate, and operationalize datasets and evaluation frameworks for AI product performance assessment.

This role involves working with large language models (LLMs), human raters, and automation tools to measure model accuracy, correctness, and usability.

Key Responsibilities

Develop and apply data labeling and scoring guidelines based on Google’s evaluation framework.

Implement LLM-judge calibration workflows to align automated and human evaluations.

Perform error analysis, drift detection , and regression testing of AI model outputs.

Collaborate with automation engineers to integrate datasets into evaluation pipelines.

Support rater training , inter-rater reliability checks, and dataset validation reviews.

Manage data quality assurance and documentation for contributions to Google-maintained repos...

Apply for this Job

Submit your application for the AI Evaluation & Data Engineer for LLM Metrics position at Net2Source (N2S).

Apply Now Save for Later

Job Overview

Job Type: Full-time
Location: región centro, Mexico
Posted: June 05, 2026
Deadline: July 15, 2026