Design and implement data processing plans in clinical trials, through design, testing, deployment, and maintenance to specific trial protocol requirements
Collaborate with Clinical Operations, Infrastructure teams, Data Scientists, and Data Managers to pull together effective data tooling
Turn your hand to multiple different datasets and transfer methods for multimodal healthcare data, integrating HRCT imaging and clinical metadata
Care about data quality, and ensuring the pipelines you build are robust, scalable, and maintainable
Ensure that our data processes have quality and compliance designed in from the start to make reproducibility, lineage tracking, and data quality painless
Own project data delivery and help shape automation of regulatory compliance.
Requirements
Proven experience as a Data Engineer in complex, data-rich environments
Strong programming skills in Python
Experience building and maintaining production data pipelines within regulated environments
Experience enforcing data quality requirements
Experience with automation/orchestration tools such as Dagster, containerisation, and cloud infrastructure on AWS
Strong collaboration skills and attention to detail, particularly through understanding the underlying context of the data
Even better if you have experience of...
Medical imaging data such as CT, MRI, or DICOM
Deploying in Kubernetes, particularly using gitops tools such as Flux
Data versioning and reproducibility frameworks
Working in regulated or GxP or ISO 13485 environments, or with clinical trial data
Tech Stack
AWS
Cloud
Flux
Kubernetes
Python
Benefits
A comprehensive benefits package that includes an annual bonus plan, private medical insurance, life insurance, and a contributory pension scheme
25 days annual leave, plus bank holidays and enhanced maternity leave
A diverse work environment that brings together experts in many fields, including software engineering, devops, data science, machine learning, quality assurance, regulatory affairs, and clinical operations