Implement extraction and incremental load strategies for high-volume relational databases
Build and evolve models in dbt
Write quality tests and keep documentation up to date
Read, refactor, and optimize complex SQL queries to improve performance
Work with Docker and Linux environments
Follow a Git workflow with Pull Requests and ongoing code reviews with the team
Develop custom extractors or use ingestion tools such as dlt, orchestrate pipelines with Dagster or Airflow, and load data into analytical databases like ClickHouse.
Requirements
Solid Python skills for data pipelines (2+ years)
Advanced SQL — reading, refactoring, and optimizing complex queries
dbt — hands-on experience building models, tests, and documentation
Experience with high-volume relational databases and extraction/incremental load strategies
Docker and comfort with Linux environments
Git and workflow with PRs and code review
Plus: ingestion tools like dlt or building custom extractors
Orchestrators (Dagster, Airflow)
ClickHouse or other analytical databases
Data catalog and governance tools (e.g., OpenMetadata)