Function Health is the AI operating system for health, designed to empower people to live 100 healthy years. They are seeking an experienced Senior Software Engineer, Data Platform to contribute to the design, development, and optimization of their data infrastructure, ensuring seamless data ingestion, processing, and access across the organization.
Responsibilities:
- Contribute to the design, development, and scaling of core data infrastructure using GCP, Spark, Databricks, and Fivetran
- Develop robust and maintainable ETL/ELT workflows that support diverse structured and unstructured data needs across the organization
- Implement and manage Change Data Capture (CDC) pipelines to enable near real-time data replication and synchronization
- Define and enforce data governance and compliance standards, including access control, auditability, lineage, and metadata management
- Build and manage streaming and batch data pipelines to serve high-impact use cases across analytics, product, compliance, and experimentation
- Act as a strategic partner to cross-functional teams (product, analytics, engineering, clinical) to ensure data is accessible, trustworthy, and impactful
- Drive the long-term architectural vision of our data platform to support current and future business and product needs
Requirements:
- 5+ years of experience in software engineering, with a focus on scalable data architectures
- Strong expertise in GCP (IAM, GCS, Pub/Sub, etc.) and hands-on experience with Spark and Databricks
- Hands-on experience with CDC technologies like Fivetran, or equivalent
- Proficiency in ETL/ELT tools and frameworks (dbt, Apache Airflow, Dataform, etc.)
- Deep understanding of data governance principles, including compliance and security best practices
- Demonstrated success in collaborating across functions to deliver data solutions for analytics, experimentation, or compliance
- A balance of IC execution and leadership skills; you're equally comfortable rolling up your sleeves or mentoring others
- Familiarity with streaming data architecture, real-time ingestion, and delivery frameworks
- Proficient in SQL and Python for data processing and automation
- Strong problem-solving skills with the ability to work in a fast-paced environment
- Excellent communication and technical storytelling skills — you can align technical work with business value
- Bias Toward Action: Demonstrated ability to take initiative, make decisions under uncertainty, and move projects forward even in the face of ambiguity
- Entrepreneurial Spirit: Strong adaptability to changing business needs with a knack for building and optimizing processes
- Communication: Excellent communication skills, capable of explaining complex technical concepts to non-technical stakeholders
- Remote Work Adaptability: Comfort with remote work environments, demonstrating the ability to stay productive and connected with the team irrespective of physical location
- Continuous Improvement: A willingness to question assumptions and a commitment to continuous improvement
- Experience with Terraform or Infrastructure-as-Code (IaC) for data infrastructure automation
- Background in HIPAA or other regulated environments with sensitivity to data privacy and compliance
- Familiarity with the dbt Semantic Layer and modern data modeling best practices
- Exposure to data observability platforms and practices
- Familiarity with machine learning data pipelines
- Exposure to multi-cloud or hybrid-cloud environments
- Experience building scalable solutions in a 0-1 environment