Unity is looking for a Senior Machine Learning Infrastructure Engineer to join their Vector Ads team, which builds the real-time systems powering Unity's global advertising platform. The role involves designing and maintaining the infrastructure that serves machine learning models in real-time, ensuring reliability and performance in a high-scale environment.
Responsibilities:
- Design, build, and maintain the infrastructure that serves ML models in real-time across Unity's ads ecosystem
- Build and operate scalable model serving pipelines — owning latency, throughput, and reliability in a high-QPS production environment
- Partner with ML engineers to productionize models, manage model deployments, and improve iteration speed
- Improve observability, performance, and cost-efficiency of ML serving infrastructure
- Contribute to architectural decisions around feature serving, model versioning, and inference optimization
Requirements:
- Experience building and operating ML infrastructure or model serving systems in production
- Proficiency in Golang or Python, with strong systems engineering fundamentals
- Hands-on experience with Kubernetes and container orchestration at scale
- Familiarity with ML serving frameworks such as Ray Serve, Triton, TorchServe, or similar
- Understanding of distributed systems, API design, and system reliability
- Strong collaboration and communication skills in a remote-first environment
- Experience with feature stores, feature pipelines, or online/offline feature serving
- Background in ad tech, real-time bidding, or programmatic advertising systems
- Familiarity with infrastructure-as-code such as Terraform
- Experience with observability tooling — Prometheus, Grafana, OpenTelemetry
- Background with real-time data pipelines, caching layers, or low-latency serving systems