Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The Senior DevOps Engineer will design and implement CI/CD pipelines, ensure production reliability, and drive cloud modernization initiatives while collaborating with cross-functional teams to enhance data pipelines and analytics platforms.
Responsibilities:
- Design, implement, and operate CI/CD pipelines supporting application, data, and platform deployments across Azure and Databricks environments
- Own production reliability, availability, performance, and scalability for cloud platforms, including Databricks workspaces, jobs, clusters, and workflows
- Build and maintain Infrastructure as Code (IaC) and configuration management to provision and manage Databricks, cloud infrastructure, and networking in a repeatable and secure manner
- Automate Databricks environment management, including workspace configuration, cluster policies, job orchestration, and access controls
- Implement and enhance monitoring, alerting, and observability across cloud and Databricks platforms using Splunk, Azure Monitor, and telemetry frameworks
- Partner with data engineering, platform, security, and product teams to enable reliable, compliant, and scalable data pipelines and analytics platforms
- Drive cloud and platform modernization initiatives, including containerization, platform standardization, and Databricks best practices
- Embed AI‑assisted DevOps practices, leveraging GenAI tools to accelerate troubleshooting, automate operational tasks, improve deployment reliability, and optimize system performance
- Enable AIOps capabilities such as intelligent alerting, anomaly detection, log analysis, and predictive insights for proactive operations across Databricks workloads
- Support production issue triage and resolution, including on‑call support, incident management, and post‑incident root cause analysis
- Ensure security‑by‑design and compliance across pipelines and platforms, including secrets management, RBAC, audit readiness, and governance
- Continuously reduce operational toil and improve delivery velocity through automation, AI‑driven insights, and self‑service tooling
- Design, develop, and deploy AI-powered solutions to address complex business challenges with emphasis on responsible use of AI
Requirements:
- Bachelor's degree in Computer Science, IT or Engineering related field
- 5+ years of experience in DevOps, Platform Engineering
- 5+ years of experience with CI/CD tools such as Git, Jenkins, or equivalent enterprise platforms
- 3+ years of experience with .NET, Angular and Typescript
- 3+ years of proven experience operating and supporting any one Azure public cloud infrastructure
- 3+ years of experience in scripting or programming languages (Python, Shell, or similar) with a strong automation mindset
- 3+ years of experience in Infrastructure as Code (IaC): Terraform‑based, repeatable environment provisioning
- 1+ years of hands‑on experience with Databricks platform operations, including clusters, jobs, workflows, and environment configuration
- Experience with Infrastructure as Code, automation, and configuration management practices
- Experience supporting mission‑critical data and analytics platforms, including incident management and RCA
- Experience with containerization and orchestration technologies (Docker, Kubernetes, AKS)
- Experience operating in security‑ and compliance‑driven environments, including RBAC, audit controls, and governance
- Prior experience mentoring engineers or acting as a technical lead within DevOps or platform initiatives
- Solid understanding of monitoring, logging, and observability in large‑scale production environments
- Strong communication skills and ability to collaborate with cross‑functional engineering teams
- Exposure to AI‑enabled DevOps or AIOps, including automated remediation, intelligent monitoring, or predictive analytics
- Familiarity with GenAI tools for AI‑assisted troubleshooting, pipeline optimization, or operational insights