Shopify is a company focused on enabling entrepreneurship and creating value for the world. They are seeking experienced Site Reliability Engineers to help build and scale their platform, ensuring resilience and performance for millions of merchants.
Responsibilities:
- Help Shopify run its planet scale systems by enabling our engineering teams to create resilient systems
- Build and improve tools to keep our platform resilient and performant
- Ensure we never fail for the same reason twice
- Go on-call and respond to automated alerts and execute playbooks
- Directly impact production systems underpinning commerce for millions of merchants, who generate revenue for their livelihood, their families, and their employees, through the businesses they’ve built on our platform
- Identify gaps in our processes and build or improve tools to support incident management
- Develop production tooling and services to improve our platform’s resilience
- Clean up the noise in our signals, ensuring we can get an understanding of our platform and more efficiently debug problems
Requirements:
- Experience as a Site Reliability Engineer or software engineer
- Ability to build and scale robust and performant systems
- Experience with incident management and developing production tooling
- Willingness to go on-call and respond to automated alerts
- Ability to work within the geographic requirements of EMEA, available and on-call from 0800 UTC - 1400 UTC during on-call weeks
- Commitment to developing and mastering your craft
- Ability to thrive in a fast-paced, changing environment
- Critical thinking and opinion
- Ability to work digital-first