Vals AI is focused on building the measurement layer for the AI economy, and they are seeking a Head of Research to lead their efforts in developing methodologies for evaluating AI systems. The role involves advancing the science of evaluation, overseeing research projects, publishing impactful work, and collaborating with enterprise customers.
Responsibilities:
- Advance the science of evaluation. The methodologies the field uses today — judge models, human-in-the-loop, static benchmarks — were built for a previous generation of models and break down on long-horizon, real-world tasks. You'll develop the new paradigms
- Oversee Vals' broader research portfolio, setting direction across the projects already underway and the ones we haven't started yet
- Publish work that moves the field forward. We want Vals' research to be cited, not just shipped
- Recruit and grow a research team alongside the founders
- Work directly with our enterprise customers and lab partners on the evaluation problems they actually have