LawZero is a non-profit organization focused on advancing research and creating technical solutions for safe-by-design AI systems. They are seeking a Senior Distributed ML Engineer to collaborate with researchers on large-scale model training and inference in distributed computing environments.
Responsibilities:
- Collaborate with researchers to accelerate research, model training and inference, and facilitate the use of large-scale models in distributed computing environments
- Investigate performance bottlenecks, profile research experiment code, debug reported issues, and optimize the utilization of computing resources
- Develop tools and libraries to simplify and orchestrate the use of distributed computing resources for research experiments
- Establish, document, and maintain best practices for large-scale, distributed ML model development workflows