Cortica is a rapidly growing healthcare company pioneering the most effective treatment methods for children with neurodevelopmental differences. The Senior AI Data Engineer will serve as both architect and builder of the data ecosystem, working closely with various teams to design intelligent, automated data solutions that enhance operational efficiency and care decisions.
Responsibilities:
- Engage stakeholders directly to gather, clarify, and document project requirements
- Translate requirements into architected data solutions: choose the right storage, pipeline, modeling, and delivery approach for each problem
- Own testing end-to-end — unit tests, data quality checks, reconciliation, and integration tests before anything reaches production
- Deploy solutions to production and monitor post-deployment health, iterating rapidly based on real-world feedback
- Run parallel AI coding sessions (Claude Code, Cursor, Codex) across different facets of a pipeline simultaneously — orchestrate, verify, and integrate the outputs
- Build and maintain context files (CLAUDE.md equivalents) for data projects that encode schema conventions, pipeline patterns, and institutional knowledge — making every future AI session smarter
- Design verification loops: automated data quality checks, dbt tests, CI hooks, and pipeline monitors that give AI agents concrete feedback on correctness
- Build MCP (Model Context Protocol) or equivalent integrations to connect AI agents directly to Snowflake, Amazon Athena, Postgresql, MySql, Power BI APIs, Salesforce, and internal tooling
- Prefer frontier models for complex architectural decisions and rely on AI acceleration to dramatically increase engineering throughput
- Design and build complex, reliable data pipelines ingesting from AWS, Azure, Salesforce, MuleSoft, and multiple third-party APIs into our AWS Data Lake and Snowflake warehouse
- Implement and evolve data models using Kimball methodology to support financial, operational, and clinical analytics
- Optimize pipeline performance, manage data quality, and perform root-cause analysis on data anomalies — internal and external
- Develop and maintain orchestration workflows in Python, and AWS Glue
- Continuously evolve the data schema as business and engineering requirements change
- Build and support Power BI data models and reports; empower analytics team members to self-serve on a reliable data foundation
- Work with data analysts and data scientists to build reusable, well-documented pipeline components they can extend independently
- Deliver data products that drive clinical care decisions, financial planning, and operational performance improvements
- Build lightweight internal data applications and tooling where needed; data entry interfaces, operational dashboards, automation scripts that bridge the gap between data pipelines and end users
- Design for agentic workflows: build AI-powered data tools accessible via web interfaces or Slack that surface insights proactively
- Integrate with Salesforce Health Cloud and other platforms using APIs and event-driven patterns
- Ensure data security and HIPAA compliance in all pipeline and application work. Partner with IT to enforce data governance standards
- Document decisions, tradeoffs, and architecture clearly so that future engineers (and AI agents) can build on your work effectively
- Collaborate across IT, finance, clinical operations, and data science — acting as the connective tissue between data infrastructure and business outcomes
Requirements:
- 5+ years of hands-on data engineering experience, including building and operating production data pipelines
- Expert-level Python skills for ETL, pipeline orchestration, and automation
- Deep SQL proficiency — query optimization, data modeling, stored procedures
- 2+ years' experience working with AI first development workflows
- 4+ years' experience with the following AWS (S3, Glue, Lambda, Redshift), and/or Azure big data services
- 1+ year of experience with Snowflake
- 2+ years of experience with orchestration frameworks
- 2+ years of Salesforce experience with Apex and configurations
- Experienced with Kimball dimensional modeling — you've built star schemas and conformed dimensions in production
- Power BI (or equivalent BI tool) experience — data model design and report development
- API integration experience — REST, GraphQL, event streaming (Kafka, Kinesis, or similar)
- Application development literacy — comfortable building lightweight web tooling (Python/Flask, Node, or similar) to complement data products
- Reside in one of the following states: CA, TX, NC, WA, ID, NV, AZ, CO, KS, AR, LA, AL, GA, FL, SC, TN, VA, MD, NJ, DE, IL, WI, MI, OH, MA, PA, NH, CT