Cartesia is pioneering AI that learns and interacts like humans, driven by a team of PhDs from the Stanford AI Lab. They are seeking a Human Data Operations Manager to design and operate a global evaluation workforce, impacting model quality and customer outcomes through effective workforce management and operational performance.

Responsibilities:

Design and implement workforce structure across languages, skill tiers, and use cases, including evaluators, auditors, and leads for TTS products
Build capacity models to support continuous eval pipelines and data production workflows
Own relationships with vendors such as data annotation firms and contractor platforms, negotiating rate cards, SLAs, and throughput guarantees
Decide on build, buy, or hybrid workforce models and continuously benchmark cost and performance across regions
Design multi-layer QA systems spanning self-checks, peer review, audits, and gold tasks
Define and track inter-rater reliability, error rates by category, and annotator-level performance distributions
Build escalation and retraining workflows to maintain quality at scale
Run day-to-day operations including task allocation, throughput tracking, and SLA adherence
Build systems to reduce evaluator fatigue, rotate task types, and maintain consistency across large-scale evaluations
Partner with tooling teams to improve evaluator UX and with data teams to ensure clean, structured outputs for model training

Human Data Operations Manager

Key skills

About this role

Responsibilities: