MongoDB is a leading database platform empowering customers to innovate rapidly. As a Staff Technical Program Manager for Site Reliability Engineering, you will partner with SRE leaders to enhance platform reliability, drive program execution, and coordinate cross-functional efforts to ensure smoother launches and stronger reliability metrics.
Responsibilities:
- Drive Program Planning & Execution – Define program scope, milestones, and success criteria with SRE engineers and leaders. Manage dependencies across platform teams, keep work clearly tracked in Jira, and deliver on time
- Strengthen Production Reliability – Lead change management and launch readiness programs. Partner with SREs and product teams to define and operationalize SLOs/SLIs, and use incident data, metrics, and capacity signals to drive prioritization and continuous improvement
- Lead Cross-Functional Coordination – Align SRE with Security, Compliance, Cloud platform, and other engineering teams. Coordinate cross-team incident response, ensure clear follow-through, and build trust as the go-to driver of complex, multi-team efforts
- Build Scalable Systems & Processes – Design lightweight frameworks and communication patterns that help SRE deliver reliably at scale. Work yourself out of the "hero" role by leaving teams better-equipped to execute independently
Requirements:
- 8+ years in technical program management, engineering management, or a comparable technical role partnering with software engineering teams
- Proven track record leading large-scale, cross-team platform initiatives through ambiguity and change
- Strong knowledge of production change management, software development lifecycle, and reliability metrics (SLOs, SLIs)
- Skilled at shaping roadmaps and managing dependencies
- Able to query and interpret metrics, logs, or other data sources to inform decisions and communicate risk
- Excellent communicator—clear, concise, and calm—across engineers, cross-functional partners, and executives
- Low-ego, highly collaborative, and motivated by ownership of hard problems end to end
- Hands-on or close-partner experience with Kubernetes, cloud networking, or observability stacks (metrics, logs, tracing, alerting)
- Prior experience working with or alongside SRE teams
- Background in large-scale cloud infrastructure or platform engineering
- Familiarity with MongoDB Atlas or other modern cloud database platforms