Design, evolve, and operate scalable and elastic cloud architectures for multi tenant SaaS platforms
Continuously challenge and improve existing infrastructure and architectural decisions to remove performance, scalability, and operability bottlenecks
Design and maintain cloud native and hybrid solutions, integrating cloud platforms with on prem systems when required
Build, maintain, and improve CI/CD pipelines that enable fast, safe, and repeatable deployments
Promote and enforce Infrastructure as Code (IaC) practices using Terraform
Automate provisioning, configuration, scaling, and recovery to reduce manual operational effort
Improve deployment strategies in collaboration with SRE teams to increase reliability and predictability
Design and operate containerized platforms using Docker and Kubernetes
Support and evolve microservices architectures, ensuring deployment safety, isolation, and scalability
Operate and support production and pre-production environments and troubleshoot complex infrastructure issues
Participate in incident response and on call rotations when required, working with SREs to reduce operational toil
Maintain clear and up to date documentation for infrastructure, pipelines, and operational procedures
Partner closely with engineering teams to improve developer experience, delivery velocity, and platform reliability
Support other tasks or projects as assigned to meet team and business needs
Requirements
3 or more years of experience as a DevOps Engineer, Platform Engineer, or Senior Software Engineer with infrastructure ownership
Strong hands on experience with cloud platforms, preferably Azure, with AWS or GCP also acceptable
Proven experience with Kubernetes, preferably AKS, Docker, Terraform, CI/CD systems such as Jenkins and Groovy pipelines, Linux based systems, and configuration management tools like Ansible
Strong scripting and automation skills using Python and or Bash
Solid understanding of distributed systems and cloud networking fundamentals
Comfortable working with Git based workflows such as Bitbucket
Excellent written and spoken communication skills in English to collaborate effectively with distributed teams
Experience operating or supporting multi tenant SaaS platforms is a plus
Previous collaboration with SRE teams and exposure to reliability practices would be beneficial
Familiarity with monitoring and observability tools such as Grafana, Datadog, or ELK would be helpful
Hands on exposure to hybrid cloud environments or on-prem integrations would be considered an advantage
A software engineering background with experience owning systems end to end would be positively valued.
Tech Stack
Ansible
AWS
Azure
Cloud
Distributed Systems
Docker
Google Cloud Platform
Grafana
Groovy
Jenkins
Kubernetes
Linux
Microservices
Python
Terraform
Benefits
Work from anywhere
this is a remote opportunity, with a primary hiring focus on candidates located within the European Union, in line with team needs and operational coverage
A competitive salary that values you and your unique skill sets
Career advancement & professional development opportunities to help you reach your full potential
Flexible work arrangements to support work/life balance