Button is building the future of the commerce-powered internet, and they are seeking a Senior DevOps Engineer to join their Infrastructure team. The role involves building, maintaining, and evolving Button’s platform to ensure it is scalable, stable, and operable, while partnering with engineering teams to support product development and infrastructure needs.
Responsibilities:
- Expand our system instrumentation and tooling with monitoring, alerting, logging, and tracing for our critical business tasks; you will be responsible for identifying and following through on key system metrics
- Build, improve, maintain, and otherwise support business-critical systems
- Support new feature development as the go-to-partner for Product Engineering for the infrastructure and data needs, providing tools and guidance when it comes to best practices and solving problems with our unique constraints
- Manage and monitor most aspects of our production serving environment. We're an AWS shop, and we make heavy use of ECS, RDS, and EC2 in production, all managed through Terraform
Requirements:
- 5+ years of experience supporting and building infrastructure with direct hands-on experience with a variety of tools and frameworks
- Experience with EC2, RDS, Aurora, and ECS, all managed through Terraform
- Experience in event-driven and queue-driven architecture, serverless, DynamoDB, and step functions
- Experience with Docker
- Experience with build technologies such as Make / Pantsbuild / Bazel / Buck2
- Experience administering and scaling monitoring and observability solutions (e.g., Grafana, Prometheus, Datadog, New Relic, or similar)
- Proficiency with CI/CD solutions
- Proficiency with AWS
- Fluency in and around Linux systems
- A security- and safety-oriented mindset
- An ability to move fast, make decisions, and take a pragmatic approach to any problem
- A track record of 'leveling-up' the team around them, driving impact not just through their own contributions but also by elevating others
- Experience and comfort in a production environment
- Experience with Python, Go, or Node.js
- Experience with GCP
- Experience in AWS CDK