Cohere is on a mission to scale intelligence to serve humanity by training and deploying frontier models for AI systems. They are seeking a Staff Software Engineer to join their Model Serving team, responsible for developing and operating the AI platform that delivers large language models through API endpoints.
Responsibilities:
- Developing, deploying, and operating the AI platform delivering Cohere's large language models through easy to use API endpoints
- Work closely with many teams to deploy optimized NLP models to production in low latency, high throughput, and high availability environments
- Interface with customers and create customized deployments to meet their specific needs