San Francisco, California, United States of America
Full Time
6 days ago
$145,200 - $196,400 USD
No Visa Sponsorship
Key skills
PythonAIMLNLPGenAILLMRAGAgenticA/B Testing
About this role
Role Overview
Design and evaluate information access + reasoning strategies across RAG, agents, and classic ML: chunking, embedding models, hybrid search, metadata filtering, semantic routing
Prototype GenAI workflows (including agentic systems) that map and reason over compliance objects (controls ↔ risks ↔ requirements ↔ evidence)
Explore ML + probabilistic approaches where GenAI is not the best fit: classifiers, ranking models, graph/link prediction, calibration, and structured prediction
Build and maintain evaluation frameworks: golden datasets, automated quality metrics, regression detection