Unity is looking for a Senior Machine Learning Infrastructure Engineer to join their Vector Ads team, which builds the real-time systems powering Unity's global advertising platform. The role involves designing and maintaining the infrastructure that serves machine learning models in real-time, ensuring reliability and performance in a high-scale environment.

Responsibilities:

Design, build, and maintain the infrastructure that serves ML models in real-time across Unity's ads ecosystem
Build and operate scalable model serving pipelines — owning latency, throughput, and reliability in a high-QPS production environment
Partner with ML engineers to productionize models, manage model deployments, and improve iteration speed
Improve observability, performance, and cost-efficiency of ML serving infrastructure
Contribute to architectural decisions around feature serving, model versioning, and inference optimization

Requirements:

Experience building and operating ML infrastructure or model serving systems in production
Proficiency in Golang or Python, with strong systems engineering fundamentals
Hands-on experience with Kubernetes and container orchestration at scale
Familiarity with ML serving frameworks such as Ray Serve, Triton, TorchServe, or similar
Understanding of distributed systems, API design, and system reliability
Strong collaboration and communication skills in a remote-first environment
Experience with feature stores, feature pipelines, or online/offline feature serving
Background in ad tech, real-time bidding, or programmatic advertising systems
Familiarity with infrastructure-as-code such as Terraform
Experience with observability tooling — Prometheus, Grafana, OpenTelemetry
Background with real-time data pipelines, caching layers, or low-latency serving systems

Senior Machine Learning Infrastructure Engineer

Key skills

About this role

Responsibilities:

Requirements: