About this role

Anthropic is a public benefit corporation focused on creating reliable and interpretable AI systems. The Research Engineer role within the Reinforcement Learning team involves collaborating with researchers and engineers to advance AI capabilities, implementing novel approaches, and conducting fundamental research in reinforcement learning.

Responsibilities:

Collaborate with a diverse group of researchers and engineers to advance the capabilities and safety of large language models
Implement novel approaches and contribute to the research direction
Work on fundamental research in reinforcement learning, creating 'agentic' models via tool use for open-ended tasks such as computer use and autonomous software generation, improving reasoning abilities in areas such as mathematics, and developing prototypes for internal use, productivity, and evaluation
Architect and optimize core reinforcement learning infrastructure, from clean training abstractions to distributed experiment management across GPU clusters
Help scale our systems to handle increasingly complex research workflows
Design, implement, and test novel training environments, evaluations, and methodologies for reinforcement learning agents which push the state of the art for the next generation of models
Drive performance improvements across our stack through profiling, optimization, and benchmarking
Implement efficient caching solutions and debug distributed systems to accelerate both training and evaluation workflows
Collaborate across research and engineering teams to develop automated testing frameworks, design clean APIs, and build scalable infrastructure that accelerates AI research

Research Engineer, Machine Learning (Reinforcement Learning)

Key skills

About this role

Responsibilities: