Own the operational lifecycle of external data partnerships following contract signature.
Manage secure biomedical data transfers using cloud infrastructure and standardized transfer protocols.
Collaborate with internal technical and product teams to define and maintain harmonized data models and metadata standards.
Work closely with engineering and data teams to configure and maintain lightweight ingestion and QC pipelines.
Identify missing data, inconsistencies, corruption, or metadata mismatches and work directly with external partners to resolve issues.
Maintain a centralized “single source of truth” for all incoming datasets, including ingestion status, completeness, QC status, and milestone tracking.
Partner closely with Data Science, Engineering, Legal, and Partnership teams to align operational execution with business and scientific priorities.
Conduct periodic visits to partner hospitals, biobanks, and laboratories to support onboarding.
Requirements
Strong understanding of clinical and biomedical data structures, including real-world data, clinical trial datasets, and multi-omics data modalities.
Proven experience managing data lifecycles in cloud environments, particularly AWS (S3, CLI, access management).
Proficiency in Python or R, along with SQL for querying and transforming datasets.
Demonstrated ability to manage multiple external collaborations and operational workstreams simultaneously.
Comfortable working independently in ambiguous environments.
Bachelor's or Master's degree in Life Sciences, Bioinformatics, Health Informatics, Computer Science, or a related quantitative field.
Tech Stack
AWS
Cloud
Python
SQL
Benefits
A collaborative and mission-driven work environment.
Competitive salary and equity package.
Flexible work arrangements, including remote options.
Opportunities for professional growth and leadership development.
The opportunity to shape the future of biology and AI through groundbreaking work.