Home
Jobs
Saved
Resumes
Senior Site Reliability Engineer at Megaport | JobVerse
JobVerse
Home
Jobs
Recruiters
Companies
Pricing
Blog
Jobs
/
Senior Site Reliability Engineer
Megaport
Remote
Website
LinkedIn
Senior Site Reliability Engineer
Australia
Full Time
3 weeks ago
Visa Sponsorship
Apply Now
Key skills
AWS
Cassandra
Cloud
Kubernetes
Linux
Postgres
Python
Terraform
Go
Bash
GitHub
Version Control
CI/CD
About this role
Role Overview
Improving production reliability and system resilience within an SRE scoped team
Championing high standards of work and industry best practices
Communicating with teams and stakeholders at all stages
Bringing fresh ideas to the table and encouraging others
Diving into complex technical problems with a can-do attitude
Working across numerous technologies in a fast-changing industry
Participating in on-call rotation, incident response, and blameless post-incident reviews
Writing code, handling alerts, improving solutions, and supporting others
Playing a crucial role in the success of your company and team
Requirements
5+ years administering Linux systems and related infrastructure in production environments
A collaborative SRE mindset, with familiarity around SLIs/SLOs/SLAs, error budgets, blast radius, and blameless postmortems
A focus on automation, reducing toil, and preventing problem recurrence
A track record of writing runbooks that work for the broader team, not just yourself
Strong Kubernetes and broader ecosystem fundamentals
Cloud infrastructure experience; AWS strongly preferred and bare-metal is a bonus
Strong tool development
Bash, plus either Python or Go preferred, or similar
Infrastructure-as-code tooling experience
Terraform preferred
CI/CD and version control, GitHub preferred
Database experience
one of Postgres, Cassandra, or ClickHouse preferred
Experience operating a production observability stack (metrics, logs, traces), with an eye for signal over noise
Comfortable working on live production infrastructure, with strong troubleshooting instincts and ownership of incident response
A history of continual professional development
A self-directed style suited to an async, globally distributed team, and comfortable picking up adjacent work when the situation calls for it
Tech Stack
AWS
Cassandra
Cloud
Kubernetes
Linux
Postgres
Python
Terraform
Go
Benefits
Flexible working environments
Birthday Leave
Generous study and training allowance + 5 days paid study leave
Creative, fun, and contemporary workspaces
Motivated team of industry experts and new talent
Celebrated success with ‘Legend’ and ‘Kudos’ Awards
Health and wellness program
Apply Now
Home
Jobs
Saved
Resumes