Architect is an AI research and product lab for chip design, focused on reimagining chip design using AI. As a Research Intern, you will work alongside the founding team to conduct experiments that influence the core modeling roadmap, specifically in Reinforcement Learning and hardware optimization.
Responsibilities:
- Responsible for co-designing and implementing the Reinforcement Learning experiments (GRPO/PPO/DPO), training data mixes and reward signal explorations
- Contribute to research on post-training techniques, running ablation studies to improve model reasoning and alignment capabilities
- Implement and test new algorithms for model fine-tuning and evaluation, helping to translate research papers into working prototypes
- Analyze experimental results and debug model behavior to help establish best practices for our training recipes