Sourcebooks is an innovative publishing powerhouse that believes in the transformative power of books. They are seeking a contract data engineer to help scale their data platform by building and extending infrastructure that supports informed decision-making within the organization.
Responsibilities:
- Expand silver and gold table coverage by converting raw data into business-ready tables, incorporating business logic from analytics stakeholders
- Maintain and debug pipelines in Microsoft Fabric and Azure Data Factory
- Validate incremental load logic
- Investigate data issues
Requirements:
- 3-5 years of experience in analytics or ideally data engineering
- Strong SQL skills -- multi-CTE queries, and comfortable debugging code a team member wrote
- Hands-on experience with a cloud data platform (Fabric, Databricks, or similar), and Python/PySpark as a secondary skill, enough to read and modify existing notebooks
- Have worked within established systems, follow Spark SQL conventions, and communicate blockers clearly
- Have experience with Microsoft Fabric
- Be familiar with Delta Lake and medallion architecture
- Have experience with financial transaction data (SAP, NetSuite)
- Azure CLI or REST API experience for platform-level queries
- Have exposure to agentic AI workflows or AI-assisted development for debugging