Support and coordinate the ongoing operational activities (compliance, patching, deployment, monitoring, reporting) for our private and public cloud hosted solutions
Own end-to-end availability and performance of infrastructure platform and build automation to prevent problem recurrence
Manage on-call rotations across continents, using a follow-the-sun model
Design, write and deliver software to improve the availability, scalability, latency, and efficiency of systems
Focus on optimizing existing systems, building infrastructure, and eliminating work through automation
Debug system-level problems in a multi-vendor, multi-protocol network environment with high-level technical expertise on complex issues
Resolve all technical issues, clearly documenting the analysis and customer communication and action items within the RCA post engagement
Collaborate across multiple global engineering teams
Requirements
Bachelor’s degree in computer science or information technology
5+ years of DevOps experience in a large enterprise organization
Proven experience in leading global teams
Excellent communication skills and a sense of ownership, with a systematic problem-solving approach
Solid experience designing, analyzing, and troubleshooting large-scale distributed systems
Proficient knowledge of TCP/IP, HTTPs, SSO-SAML/OICD and SaaS understanding
Solid experience with Infrastructure as a code – CloudFormation, Terraform
Proficient experience supporting large scale cloud environments, including multi-site Kubernetes deployments and working with container technologies, deployment and orchestration (Docker, Kubernetes, Helm)
Prior experience deploying and managing EKS clusters using CloudFormation
Proficient experience with Access Control Policy / RBAC
Proficient experience in Python / Go (Golang) / C++ / Java
Solid experience with Bash or Shell Scripting
Good understanding of Linux/Unix
Solid experience with cloud vendors like AWS, OCI, Azure, private cloud etc.
Skilled in CI/CD pipeline creation and deployment of infrastructure as code in AWS using Cloud Formation, Terraform, knowledge about GitOps / GitOps tooling (e.g ArgoCD)
Proficient experience in SaltStack/Ansible/Chef/puppet
Solid experience with Source Code Management tools (GitHub, Gitlab, SVN) and an understanding of branching and integration processes