Netflix is a leading entertainment company focused on pushing the boundaries of storytelling through technology. They are seeking a Senior Software Engineer to develop and expand their model serving infrastructure for AI/ML applications, enabling innovation and supporting the growing AI needs across the organization.
Responsibilities:
- Develop and expand compute infrastructure to support growing AI needs
- Enable application of ML in new business areas
- Drive AI/ML innovation across Netflix
- Partner with other engineers, product managers, machine learning engineers, and data/research scientists
Requirements:
- You have experience building high-traffic distributed services and infrastructure for online ML model inference and are familiar with supporting large-scale ML models focusing on high availability and performance
- You understand scalable model-serving solutions for generative models and LLMs, with skills in reducing latency and costs, and can solve bottlenecks to streamline research-to-production workflows
- You are proficient in object-oriented programming (preferably Java) and demonstrate engineering excellence in production hosting, including performance tuning, deployment management, and capacity planning
- You are familiar with deploying ML models using tools like Triton Inference Server, TensorRT, Docker
- You are experienced working with the public cloud like AWS, Azure, or GCP
- You are a proactive communicator who promotes best practices in observability and logging
- You have a BS/MS in Computer Science, Applied Math, Engineering, or a related field