Build and maintain the shared data access library and SDKs that Platform, Packaging, and Dataset API teams use to read from and write to multiple data sources
Design and implement event-driven data flows using event brokers, CDC connectors, schema registry, event routing, dead letter queues
Build the systems that track how data moves through the platform (lineage), enforce who can access what (governance and RBAC), and log what happened (auditing)
Instrument the data platform with OpenTelemetry, define and monitor SLOs for query latency and pipeline success rates
Contribute to infrastructure cost visibility and optimization
Requirements
4+ years building platform infrastructure, data infrastructure, data platforms, or backend systems with significant data components
Strong proficiency in Python
Hands-on experience with SQL and at least two of: Snowflake, Redshift, Postgres
Experience with AWS — S3, RDS, EKS, EventBridge, IAM
Experience with Kubernetes
Familiarity with data orchestration tools (Prefect, Airflow, or Dagster) and transformation frameworks (dbt)
Understanding of data governance concepts — RBAC, PII handling, audit logging, data lineage
Fluency with AI-assisted development tools (Claude Code, Cursor, or similar)