Upstart is an AI lending marketplace dedicated to reducing the cost and complexity of borrowing for Americans. The Senior DevOps Engineer will evolve the Ephemeral Infrastructure platform, focusing on Kubernetes-based environments and automation to enhance software development efficiency and reliability.
Responsibilities:
- Design, build, and operate Kubernetes-based ephemeral environments that enable engineers to develop, test, and validate software efficiently
- Improve the reliability, scalability, performance, and usability of the Ephemeral Infrastructure platform through automation and platform enhancements
- Partner with product engineering, platform, security, and reliability teams to integrate infrastructure capabilities and improve developer workflows
- Build infrastructure automation, tooling, and self-service capabilities that reduce operational toil and accelerate software delivery
- Enhance observability, incident response, and operational practices to improve platform health and engineer productivity
- Contribute to the long-term architecture and technical direction of Upstart’s developer platform and cloud infrastructure ecosystem
Requirements:
- Bachelor's degree in Computer Science, Engineering, Mathematics, or a related field (or equivalent practical experience) and 4+ years of software engineering experience
- 4+ years of experience designing, deploying, and operating production Kubernetes environments
- Experience building and operating cloud infrastructure on AWS, including services such as EKS, EC2, IAM, and networking components
- Experience with Kubernetes operators, controllers, and cloud-native platform architecture
- Experience developing software and automation using Go or a comparable programming language
- Experience implementing infrastructure-as-code, CI/CD pipelines, and automated operational workflows in production environments
- Certified Kubernetes Administrator/Architect (CKA/CKAD) or equivalent certification
- Experience with Terraform, Helm, GitOps practices, and tools such as ArgoCD
- Experience operating distributed systems with a focus on observability, reliability, and incident response
- Ability to influence platform adoption and collaborate effectively across multiple engineering teams
- Experience building internal developer platforms, ephemeral environment solutions, or developer productivity tooling