Human Data Evals Lead (Remote/US/LATAM)

remotesenior

via Ashby

See if I'm a fit →Tailor my resume for this role →Apply on Ashby ↗

About this role

Reports to: CEO Owns: data proposals, sample development, quality, and pilot delivery Location: Remote / Latam / US THE ROLE You will own Anyone AI’s data initiatives and proposals to AI labs, from the data proposal or responding to requests, through pilot delivery. You own how we build proposals and develop the sample packages and benchmarks: frontier-grade packages across reasoning, coding, agents, and tool use, multi-modal and others, produced in collaboration with subject-matter experts, with expert-verified ground truth, multi-model headroom results, and QC that survives buyer-side scrutiny. You are the person who designs the sample that demonstrates our quality, converts pilots into production engagements.…

Read the full description on Anyone-ai's site →

What we'd score you on

reqspace match rubric

Five dimensions, recruiter-grade. Upload your resume and we'll generate a written explanation of where you fit and where the gaps are.

Skills match

For this role: modal, slack, teams

Level fit

This role is senior-level. We check your trajectory against it.

Domain experience

Your work in the role's domain matters more than your years total. We weight recent and direct experience.

Recency

A skill you used last quarter weighs more than one from five years ago. We grade on recency, not lifetime.

Location fit

This role is remote-eligible — we factor in your stated location and time-zone overlap.

Score yourself on this role.

Free · no card · written explanation included

See if I'm a fit →

Skills in this role

Pulled from the job description. These are the keywords we'll weight when scoring your fit.

modalslackteams

More at Anyone-ai

See all open jobs at Anyone-ai →