MongoDB is a leading database platform empowering customers to innovate rapidly. As a Staff Technical Program Manager for Site Reliability Engineering, you will partner with SRE leaders to enhance platform reliability, drive program execution, and coordinate cross-functional efforts to ensure smoother launches and stronger reliability metrics.

Responsibilities:

Drive Program Planning & Execution – Define program scope, milestones, and success criteria with SRE engineers and leaders. Manage dependencies across platform teams, keep work clearly tracked in Jira, and deliver on time
Strengthen Production Reliability – Lead change management and launch readiness programs. Partner with SREs and product teams to define and operationalize SLOs/SLIs, and use incident data, metrics, and capacity signals to drive prioritization and continuous improvement
Lead Cross-Functional Coordination – Align SRE with Security, Compliance, Cloud platform, and other engineering teams. Coordinate cross-team incident response, ensure clear follow-through, and build trust as the go-to driver of complex, multi-team efforts
Build Scalable Systems & Processes – Design lightweight frameworks and communication patterns that help SRE deliver reliably at scale. Work yourself out of the "hero" role by leaving teams better-equipped to execute independently

Requirements:

8+ years in technical program management, engineering management, or a comparable technical role partnering with software engineering teams
Proven track record leading large-scale, cross-team platform initiatives through ambiguity and change
Strong knowledge of production change management, software development lifecycle, and reliability metrics (SLOs, SLIs)
Skilled at shaping roadmaps and managing dependencies
Able to query and interpret metrics, logs, or other data sources to inform decisions and communicate risk
Excellent communicator—clear, concise, and calm—across engineers, cross-functional partners, and executives
Low-ego, highly collaborative, and motivated by ownership of hard problems end to end
Hands-on or close-partner experience with Kubernetes, cloud networking, or observability stacks (metrics, logs, tracing, alerting)
Prior experience working with or alongside SRE teams
Background in large-scale cloud infrastructure or platform engineering
Familiarity with MongoDB Atlas or other modern cloud database platforms

Staff Technical Program Manager, Site Reliability Engineering

Key skills

About this role

Responsibilities:

Requirements: