Develop and maintain data pipelines in Python
Implement extraction and incremental load strategies for high-volume relational databases
Build and evolve models in dbt
Write quality tests and keep documentation up to date
Read, refactor, and optimize complex SQL queries to improve performance
Work with Docker and Linux environments
Follow a Git workflow with Pull Requests and ongoing code reviews with the team
Develop custom extractors or use ingestion tools such as dlt, orchestrate pipelines with Dagster or Airflow, and load data into analytical databases like ClickHouse.

Solid Python skills for data pipelines (2+ years)
Advanced SQL — reading, refactoring, and optimizing complex queries
dbt — hands-on experience building models, tests, and documentation
Experience with high-volume relational databases and extraction/incremental load strategies
Docker and comfort with Linux environments
Git and workflow with PRs and code review
Plus: ingestion tools like dlt or building custom extractors
Orchestrators (Dagster, Airflow)
ClickHouse or other analytical databases
Data catalog and governance tools (e.g., OpenMetadata)
Experience with BI environment migrations
Experience with commercial ERP systems

Mid-level Data Engineer

Key skills