Development of complex SQL queries for data extraction, manipulation, and reporting and transformation technologies
Design and implement robust ETL/ELT pipelines using custom-tooling (Python/Google) and off-the shelf tooling with focus on monitoring, supportability, and resource stewardship.
Contribute to and leverage coding standards and best practices to ensure efficient and re-usable services and components.
Architect, implement and deploy new data models and data processes in production
Builds data pipelines which acquire, cleanse, transform and publish data from a wide variety of sources.
Assembles large, complex data sets which meets functional and non-functional business requirements.
Partners with data asset managers, architects, and development leads to ensure technical solution provides data which is fit for use and in line with architecture blueprints.
Identify, document, design, and implement internal process improvements.
Other duties as assigned by management.
Requirements
Bachelor’s degree required in Data Science, Computer Science or MIS, Mathematics, Engineering, or related field.
5+ years of prior experience in Data Management / ETL / ELT / Data Warehousing.
Experience in writing Data Quality routines for cleansing of data and capturing confidence score and master data management (MDM).
Hands-on experience with designing and implementing data pipelines and ELT/ETL processes (ex. Fivetran, DBT).
Hands-on experience with cloud platforms (ex.GCP, AWS, Azure) and related services (ex. BigQuery, S3, Snowflake, etc.).
Strong understanding of data modeling, data integration, and data governance principles (ex. DBT).
Experience working in a highly regulated domain (ex. Healthcare, Banking, etc.
Strong knowledge of Structured Query Language (SQL) and Transact-SQL (T-SQL).
Experience using scripting languages such as JavaScript or Python.
Experience with agile delivery methodologies.
Strong organizational, administrative, and analytical skills required.