OneOncology is positioning community oncologists to drive the future of cancer care through a patient-centric, physician-driven, and technology-powered model. The Lead Data Engineer will lead teams responsible for data operations and analytics engineering, ensuring the reliability and performance of the data platform.

Responsibilities:

Lead, mentor, and develop engineers across both the Data Operations and Analytics Engineering teams
Contribute to a culture of clear goals, open feedback, and continuous growth across the team
Promote collaboration and accountability while driving high standards for quality, efficiency, and continuous improvement
Lead incident management and resolution for pipeline failures, cluster issues, and data quality problems, driving thorough root cause analysis and preventative improvements
Define and enforce operational standards, runbooks, and on-call practices for the team
Manage and maintain Databricks Workflows and job orchestration, ensuring SLAs are consistently met
Oversee the design, development, and maintenance of data models and transformations that serve business intelligence and analytics use cases
Define and enforce analytics engineering best practices including modular transformation patterns, data testing, and code review standards
Partner with data analysts and business stakeholders to understand modeling requirements and ensure data is accurate, accessible, and well-understood
Additional responsibilities as assigned to help drive our mission of improving the lives of everyone living with cancer

Requirements:

8+ years of hands-on experience with SQL development
8+ years of experience working with relational and non-relational databases, with a strong foundation in data modeling, schema design, and query optimization
5+ years of professional experience developing scalable solutions using Python or a similar OO language
Proficient in Databricks, Spark, and Delta tables; experience with large-scale distributed data processing preferred
Hands-on experience operating and monitoring data pipelines at scale in a production environment
Solid understanding of the Lakehouse and Medallion architectures
Experience with Azure data services (ADLS Gen2, Azure Data Factory, Event Hubs, or equivalent)
Familiarity with Unity Catalog for data governance, access control, and data lineage
Proven experience with designing Data Integration/ETL pipelines, using such tools as Azure Data Factory or equivalent
Excellent communication skills with the ability to convey technical concepts and operational status to both technical and non-technical stakeholders
Experience with large-scale distributed data processing

Lead Data Engineer

Key skills

About this role

Responsibilities:

Requirements: