Design and own the ingestion architecture that moves data from 20+ enterprise source systems into the AWS Marketing Data Layer
Define scalable ingestion patterns for both legacy and direct source integrations across 18 business and data domains
Architect and document data contracts between source systems and the data layer, including schema standards and versioning, SLAs and freshness expectations, data quality and governance requirements
Design and optimize AWS S3 data lake structures, including raw, staged, and curated zones, partitioning and storage strategies, Lake Formation access controls and governance policies
Define Change Data Capture (CDC) and streaming ingestion patterns to support near-real-time data freshness requirements
Translate conceptual and logical architecture into detailed, implementable technical specifications for distributed and nearshore Data Engineering teams
Partner with engineering leadership and client stakeholders to ensure scalability, performance, maintainability, and operational excellence across the platform
Provide technical leadership and architectural guidance throughout the delivery lifecycle
Requirements
10+ years of experience in data engineering, data platform architecture, or related disciplines
3+ years serving in a Data Architect or Solution Architect capacity
Strong experience with AWS data services including AWS Glue, Amazon S3, AWS Lake Formation, Amazon Athena
Deep expertise in modern data lake architecture and large-scale data ingestion frameworks
Experience designing CDC and streaming-based ingestion patterns
Strong understanding of data contracts, schema management, and data governance principles
Advanced experience with Python and PySpark
Strong knowledge of distributed data processing and cloud-native architecture patterns
Ability to create clear architectural documentation and implementation specifications for engineering teams