MongoDB is a leading company that empowers customers to innovate at the speed of the market. They are seeking a Senior Engineer to help build a next-generation inference platform that supports embedding models for semantic search and AI-native experiences in MongoDB Atlas.
Responsibilities:
- Design and build components of a multi-tenant inference platform integrated directly with MongoDB Atlas, supporting semantic search and hybrid retrieval
- Collaborate with AI engineers and researchers to productionize inference for embedding models and rerankers — enabling both batch and real-time use cases
- Contribute to platform capabilities such as latency-aware routing, model versioning, health monitoring, and observability
- Improve performance, autoscaling, GPU utilization, and resource efficiency in a cloud-native environment
- Work across product, infrastructure, and ML teams to ensure the inference platform meets the scale, reliability, and latency demands of Atlas users
- Gain hands-on experience with tools like vLLM and container orchestration with Kubernetes