Develop custom ingestion pipelines from third-party APIs, webhooks, and SDKs.
Migrate existing data ingestion pipelines from SaaS tools to our own infrastructure.
Deliver data to the raw layer of our Snowflake warehouse with schema consistency, idempotency, and resilience to malformed or late data.
Evolve the existing AWS streaming stack (Kinesis, SQS, Lambda, API Gateway), improving cost efficiency, observability, and resilience.
Maintain infrastructure as code (Terraform) for ingestion and processing resources.
Monitor pipelines, troubleshoot, and resolve incidents.
Contribute to our ongoing work on safe data access for AI agents and tooling.
Continuously improve ingestion architecture, tooling, and engineering practices.
Requirements
5+ years of commercial Python experience in a backend or data engineering role.
Solid understanding of distributed systems architecture, including load balancing, sharding, and trade-offs between synchronous and asynchronous processing.
Hands-on experience with cloud services, preferably AWS (Lambda, API Gateway, containers in ECS/EKS).
Experience building streaming pipelines (Kinesis, SQS, or equivalents).
Experience with infrastructure as code (Terraform or equivalents).
Working knowledge of SQL: ability to read and write queries, including joins and window functions.
Commercial experience with real-world production data: ability to design systems that gracefully handle messy, duplicated, delayed, or malformed input.
Solid understanding of how data flows through production systems, including the ability to anticipate and prevent downstream issues.
Operational mindset: comfortable owning production systems and monitoring them.
Ability to make architectural decisions independently and review the work of teammates.
Nice to Have
Strong SQL and data modeling skills.
Experience with Snowflake and dbt.
Experience with workflow orchestrators (Airflow, Dagster, Prefect, or equivalents).
Experience working with analytics and product teams as a data provider.
Experience working with sensitive or regulated data.
Tech Stack
Airflow
AWS
Cloud
Distributed Systems
Python
SQL
Terraform
Benefits
Open-minded teams, a welcoming and inclusive company culture, plus the opportunity to make a real difference with a game-changing health tech product.
A competitive salary package based on your unique expertise, skillset, and impact on the product plus stock options.
In-office, remote and hybrid work opportunities.
The equipment whatever you need to be happy and productive.
A premium SIMPLE subscription.
21 days annual leave, plus bank holidays (those observed where you live).
Flexible hours. We focus on your results, not how long you spend at your desk.