Unqork is a company that empowers enterprises to build and manage AI-powered applications. They are seeking a Senior DevOps Engineer to build a next-generation control plane for Kubernetes, improve internal engineering experiences, and enhance observability across their infrastructure.
Responsibilities:
- Build the next-generation control plane that provisions, configures, and manages Unqork's Kubernetes fleet across commercial, government, and edge customer environments, continuing to push toward an architecture that is automated, modular, and built to scale
- Design and deliver self-service infrastructure tooling that enables Ops and Support teams to execute common operational workflows without engineering intervention, shifting operational work left and freeing engineers to build
- Drive observability improvements across the fleet, establishing alerting and instrumentation that produces cleaner signal, enables faster mitigation, and supports deeper root cause analysis when incidents occur
- Improve the internal engineering experience by building faster CI/CD pipelines, better tooling, and paved-road patterns that reduce cognitive load and make the right way to build and ship the obvious way
- Collaborate closely with the Principal Architect to shape technical direction across the control plane, infrastructure automation, and cross-cutting infrastructure concerns
Requirements:
- 5+ years operating production Kubernetes clusters or equivalent container orchestration experience at scale in multi-tenant or regulated environments
- Strong AWS fundamentals (EKS, IAM, VPC networking, ACM) as the primary cloud; working knowledge of GCP and Azure in a multi-cloud operating environment is a plus
- Hands-on IaC experience (Terraform, OpenTofu, or Pulumi); comfort treating infrastructure as software, with real opinions on testability and safe change management at scale
- Strong software engineering skills using general-purpose programming language (Go, TypeScript, Python, or similar) to model and manage the infrastructure lifecycle in an API and workflow driven control plane
- High autonomy in ambiguous problem spaces, defining the path forward when one doesn't exist, and taking full ownership of outcomes rather than waiting to be handed a spec
- A track record of building internal tooling or self-service workflows that non-engineering teams can actually use in production
- Experience improving observability posture, including instrumentation strategy, alert quality, distributed tracing, and reducing time from detection to resolution
- Experience in regulated or compliance-sensitive environments where security, auditability, and change management are non-negotiable
- Bias toward fewer moving parts, not more, and can articulate why every layer of complexity earns its place
- An AI-forward mindset: you actively use AI tools to accelerate your own work and think critically about where automation creates real leverage
- GovCloud or FedRAMP environment experience
- Experience designing infrastructure that supports air-gapped or customer-hosted deployment models