IMPACT Technology Recruiting is representing a client seeking a Senior Site Reliability Engineer (SRE) to support the deployment, maintenance, and operational excellence of cloud-based SaaS solutions. This role is responsible for ensuring platform reliability, performance, scalability, and continuous improvement across cloud infrastructure and delivery processes.
Responsibilities:
- Support the deployment, maintenance, and operational excellence of cloud-based SaaS solutions
- Ensure platform reliability, performance, scalability, and continuous improvement across cloud infrastructure and delivery processes
Requirements:
- Strong communication skills with the ability to discuss technical concepts with both internal teams and customers
- Hands-on experience with AWS and/ or GCP cloud environments. (90% AWS)
- Experience managing Kubernetes-based platforms
- Strong Infrastructure as Code experience using Terraform
- Configuration management experience with Ansible
- Experience with Helm and containerized application deployments
- Working knowledge of MariaDB and MongoDB
- Experience with monitoring, observability, networking, and cloud security best practices
- 3+ years of experience in Site Reliability Engineering, Cloud Operations, DevOps, or Platform Engineering
- Experience supporting highly available, production-critical environments
- Strong troubleshooting and problem-solving abilities
- Proven experience driving automation and operational improvements
- Ability to thrive in a fast-paced, collaborative environment