BIG EFFECT® is seeking a Full Stack Data Engineer to design, build, and maintain scalable data pipelines and architecture. The role involves developing APIs, managing data quality, and overseeing infrastructure and DevOps practices.

Responsibilities:

Pipeline design & orchestration
Design, build, and maintain scalable batch and streaming data pipelines. Orchestrate workflows, ensuring fault-tolerance, observability, and automated recovery
Data warehouse & lake architecture
Model and manage data in cloud data warehouses and data lake environments. Own schema design, partitioning strategies, and incremental loading patterns to optimize cost and query performance
API & backend data services
Develop and maintain RESTful or GraphQL APIs that serve processed data to internal and external consumers. Design low-latency serving layers and caching strategies to support high-traffic product features
Frontend data tooling & dashboards
Build internal and customer facing data tools, dashboards, and self-serve analytics interfaces. Partner with analysts, data scientists and product engineers to translate data needs into intuitive user experiences
Data quality & governance
Implement data quality frameworks, testing strategies, and monitoring alerting across pipelines. Contribute to data cataloging, lineage tracking, and access control policies in line with compliance requirements
Infrastructure & DevOps
Manage data infrastructure. Own CI/CD for data pipelines, containerize workloads with Docker and Kubernetes, and help define infrastructure standards for the data platform team

Requirements:

5+ years of software or data engineering experience with production systems at scale
Strong proficiency in Python and SQL; comfort with Scala or Java a plus
Hands-on experience with a cloud data platforms (Supabase, Snowflake, AWS/GCP/AZure)
Expertise in building and operating ETL/ELT pipelines in batch and/or streaming contexts
Expertise in data transformation and modeling
Expertise in developing APIs or microservices (REST, GraphQL, FastAPI, or similar)
Comfortable with containerization (Docker, Kubernetes) and CI/CD practices
Ability to build data dashboards or internal tools using React, Streamlit, Metabase, or similar
Familiarity with data visualization libraries (D3.js, Vega-Altair, Plotly)
Hands-on experience with graph databases (e.g. Neo4j, Amazon Neptune, or ArangoDB) and graph query languages such as Cypher or Gremlin; ability to model complex entity relationships as graphs and run traversal-based analytics at scale
Prior experience in music streaming data, artist and track metadata, audio feature datasets, listener behavior data, or recommendation signal pipelines at consumer scale
Experience ingesting and processing social media data — including engagement signals, user-generated content, follower graphs, and platform API integrations (e.g., Meta, TikTok, X/Twitter, YouTube)
Familiarity with marketing and advertising data domains — including campaign attribution, conversion funnels, audience segmentation, impression and click-through data, and integration with ad platforms such as Google Ads, Meta Ads Manager, or DSPs
Experience with privacy-safe data practices relevant to advertising (e.g., differential privacy, clean rooms, cookieless attribution)
Experience with ML pipelines, feature stores, or MLOps tooling

Full Stack Data Engineer

Key skills

About this role

Responsibilities:

Requirements: