Dayforce is a global human capital management company headquartered in Minneapolis, Minnesota. They are seeking a Site Reliability Engineer to bridge the gap between Development and Operations teams, enabling faster feature delivery and improving service quality across their cloud infrastructure.
Responsibilities:
- Learn and develop a deep understanding of Dayforce’s cloud infrastructure and application ecosystem
- Onboard new applications and features, ensuring successful completion of Production Readiness Reviews
- Contribute to projects that improve platform reliability, automate operational tasks, and enhance SRE processes
- Participate in incident response activities, root cause investigations, and remediation efforts
- Create and maintain runbooks and reusable operational components
- Contribute to the internal SRE repository and engineering best practices
- Build trusted relationships across Development, Operations, and broader business teams
- Participate in PagerDuty on-call rotations as required
- Support continuous improvement initiatives focused on scalability, performance, and service reliability
Requirements:
- U.S. based applicants must successfully pass a Dayforce background screening, USFedPASS (U.S. Federal Personnel Authorization Screening Standards), including a credit check, criminal/misdemeanor background check, and drug test
- Employment is contingent upon successfully passing the screening
- Due to federal requirements, only U.S. citizens, naturalized U.S. citizens, U.S. permanent residents, or Green Card holders will be considered
- Candidates must also be eligible for U.S. security clearance
- 2–4 years of experience as an SRE, System Administrator, Network Engineer, Database Administrator, or Software Engineer
- Experience with cloud platforms, preferably Microsoft Azure
- Experience with at least one object-oriented programming language, preferably C#
- Experience with scripting languages such as Python or PowerShell
- Experience with at least one database platform and querying language such as MSSQL/TSQL or PostgreSQL/PLSQL
- Strong communication and collaboration skills
- Ability to work effectively in fast-paced operational environments
- Understanding of operational excellence, automation, and reliability engineering concepts
- Experience with containerization technologies and Kubernetes
- Experience with Infrastructure as Code tools such as Terraform
- Experience automating operational workflows and remediation activities
- Familiarity with monitoring, alerting, and observability best practices
- Strong troubleshooting and root-cause analysis skills in distributed cloud environments