Quote.com is a tech-enabled omnichannel performance marketing organization that delivers high-quality, mission-critical demand at scale. They are seeking a Senior DevOps Engineer to shape the future of infrastructure and deployment automation, focusing on improving system reliability and operational efficiency.
Responsibilities:
- Design, implement, and maintain scalable cloud infrastructure primarily within GCP (primary), with additional support for AWS environments
- Develop and maintain Infrastructure-as-Code solutions using Terraform and related automation tooling
- Manage Linux-based infrastructure environments including Ubuntu, Debian, Alpine, and similar distributions
- Build and maintain containerized environments and deployment workflows using Docker
- Contribute to cloud architecture decisions related to scalability, resiliency, performance, and cost optimization
- Build, maintain, and optimize CI/CD pipelines using GitLab and GitHub Actions
- Improve deployment automation, release reliability, rollback strategies, and deployment standards
- Support Git-based workflows including CI/CD validation, deployment approvals, and release management practices
- Partner with engineering teams to improve deployment confidence and developer productivity
- Implement and maintain monitoring, alerting, logging, and observability solutions using modern application performance monitoring tools
- Participate in on-call support and production incident response processes
- Lead root cause analysis efforts and help implement preventative reliability improvements
- Develop operational runbooks and playbooks for deployment procedures, incident response documentation, recovery processes and recurring operational workflows
- Improve operational maturity through standardized monitoring, escalation, and response workflows
- Partner with Tech Leads on infrastructure hardening, compliance initiatives, vulnerability remediation, and cloud security best practices
- Collaborate with Technical Operations and Web Engineering teams to improve platform reliability and operational efficiency
- Identify opportunities for automation and operational improvements across infrastructure and deployment systems
- Maintain clear documentation for systems architecture, operational workflows, and infrastructure standards
Requirements:
- 5+ years of DevOps, Site Reliability Engineering (SRE), Infrastructure Engineering, Platform Engineering, or related experience, with demonstrated senior-level ownership of production infrastructure, deployment automation, cloud reliability, and incident response
- Experience operating independently as a senior individual contributor, lead engineer, or primary DevOps/infrastructure owner in a small company engineering environment
- Strong troubleshooting, systems analysis, and infrastructure debugging skills
- Experience building scalable cloud infrastructure and deployment automation workflows
- Strong understanding of observability, monitoring, incident response, and operational best practices
- Comfortable operating independently as the primary DevOps resource within a small engineering organization
- Strong communication and cross-functional collaboration skills
- Track record of improving operational reliability, deployment processes, and infrastructure scalability
- Hands-on expertise with and deep, demonstrated knowledge of AWS cloud infrastructure
- GitLab CI/CD
- GitHub Actions
- Terraform and Infrastructure-as-Code practices
- Docker
- Linux administration
- New Relic monitoring and observability
- Cloudflare
- CI/CD and release engineering best practices
- Production support and incident response processes
- Kubernetes
- Working knowledge of GCP environments and services
- PHP
- Node.js
- MySQL
- BigQuery
- Experience with Ubuntu, Debian, Alpine, or similar Linux distributions
- Infrastructure documentation, operational runbooks, and deployment standards
- Supporting production systems in high-availability environments
- Experience or familiarity with Ansible or other configuration management tools
- ECS, or container orchestration platforms
- Looker or analytics/reporting platforms
- Python or Bash scripting
- Cloud cost optimization and governance initiatives
- Regulated or security-focused environments