Providence is a comprehensive health care organization that serves over 50 hospitals and 1,000 clinics across multiple states. They are seeking a Senior Data Engineer to analyze clinical data, build data-centric software applications, and mentor less experienced Data Engineers while collaborating with various teams to enhance healthcare processes.
Responsibilities:
- Apply knowledge of health care and database management to analyze clinical data, and to identify and report trends
- Design and build modern data-centric software applications to support clinical and operational processes across all parts of the healthcare system
- Builds data pipelines and transformations, data enrichment processes, provisioning layers, and user interfaces to meet the requirements of key initiatives
- Mentor and assist less experienced Data Engineers and participate in research and development of new technologies to help set up repeatable processes or templates for other Data Engineers to follow
- Encourage and place a priority on collaboration with meticulous source control and documentation
- Emphasize simple solutions to complex problems using modern and emerging methods and tools
- Work closely with the Product, Platform, and Architecture teams to deliver on joint efforts
Requirements:
- Bachelor's degree in computer engineering, Computer Science, Mathematics, Engineering
- 5 or more years of experience as a Senior Data Engineer
- Master's degree in computer engineering, Computer Science, Mathematics, Engineering
- 8 years of Data development experience
- Advanced Hadoop / Spark / NoSQL (Spark, Hadoop ecosystem, Kafka, Flink, NoSQL platforms)
- Advanced SQL + Python (Python, SQL, Linux-based processing)
- OLAP / Cloud Data Warehousing (Snowflake, Synapse, Redshift, BigQuery, dimensional models)
- Unstructured & Multimodal Data Engineering (EHR/EMR, HL7, FHIR, APIs, clinical notes)
- Modern Application Development Frameworks (FastAPI, Flask explicitly listed and used)
- Modern ETL / ELT Tooling (ADF, Glue, Airflow, dbt, NiFi, Informatica, Fivetran)
- AI/ML + LLM / Prompt Engineering (LLM pipelines, RAG, LangChain/LlamaIndex, embeddings, vector DBs)