Dice is seeking a DevSecOps & Site Reliability Engineering (SRE) Technical Director to lead the secure delivery and reliability engineering of a mission-critical VA cloud platform. The role involves providing senior technical leadership for all DevSecOps practices and SRE services while ensuring CI/CD pipelines are secure, automated, and compliant.
Responsibilities:
- Maintains regular communication with the Contracting Officer's Representative (COR) and Government technical leadership regarding platform reliability, deployment activities, and operational improvements
- Provides senior technical oversight for all DevSecOps and SRE activities ensuring platform delivery velocity, security posture, and operational resilience meet or exceed contractual performance objectives
- Governs CI/CD pipeline architecture across all supported applications
- Ensures 100% of pipelines integrate automated security testing (SAST/DAST/SCA/container scanning) and enforce environment segregation
- Leads SRE practices including 24/7 on-call coverage governance, Golden Signal monitoring, Service Level Indicators/Objectives (SLI/SLO) definition, and incident response for all assigned applications
- Ensures 100% of infrastructure components are provisioned and managed through approved IaC tooling
- Enforces code reviews and approval cycle time of no more than 3 business days
- Oversees implementation of advanced deployment strategies (e.g., blue/green, canary, rolling) with automated rollback mechanisms to minimize deployment risk to production services
- Ensures all DevSecOps and SRE practices comply with VA security, privacy, and RMF requirements
- Supports ATO and continuous authorization activities
- Collaborates with leadership to align platform delivery, reliability, and observability strategies
- Continuously evaluates and implements improvements to automation, security integration, and deployment efficiency
- Improves DevSecOps metrics across the program
Requirements:
- 8 years of experience in DevSecOps, platform engineering, site reliability engineering, and technical leadership
- Bachelor's Degree in computer science, software engineering, information technology or related field
- Expert experience providing technical leadership for DevSecOps and platform engineering activities supporting complex enterprise systems in a SAFe environment
- Expert experience in cloud-native architectures, containerized environments, CI/CD pipelines, monitoring, and automated infrastructure management
- Expert experience designing, implementing, and governing enterprise CI/CD pipeline architectures (e.g., Jenkins, GitLab CI, Tekton) in a Kubernetes/Elastic Kubernetes Service (EKS) environment
- Expert knowledge of Infrastructure as Code (IaC) using Terraform
- Expert ability to integrate Static Application Security Testing (SAST), Dynamic Application Security Testing (DAST), Software Composition Analysis (SCA), container scanning, and IaC scanning into delivery pipelines
- Excellent experience enforcing secure software supply chain practices and zero-trust principles
- Excellent experience enforcing IaC governance, automated testing, and policy-as-code across all environment types
- Excellent experience with SRE practices (e.g., Service Level Indicators, objectives, agreements, error budget management, capacity planning, toil reduction via automation)
- Excellent knowledge of advanced deployment strategies (e.g., blue/green, canary, rolling deployments) with automated rollback in Kubernetes environments
- Excellent understanding of observability practices (e.g., logging, metrics, distributed tracing and integration with enterprise monitoring platforms (e.g., Dynatrace, Splunk))
- Above average experience supporting Authority to Operate (ATO) processes, continuous authorization, and audit activities in a Federal Risk management Framework (RMF) environment
- Above average knowledge of PI Planning, DevSecOps maturity models, and DevOps metrics (e.g., deployment frequency, change failure rate, MTTR)
- Working knowledge of VA OI&T security policies, FISMA compliance requirements, and NIST 800-53 control implementation in DevSecOps contexts
- Experience supporting Federal Government programs and large-scale mission-critical applications operating in cloud or hybrid environments
- Excellent written and verbal communication skills
- Active Federal Civilian Public Trust clearance
- U.S. Citizenship or Permanent Resident that has lived in the United States for at least 3 years