Harnham is seeking a senior-level Data Engineer to join a high-performing, multi-disciplinary engineering team building modern, enterprise-grade data platforms. This role involves defining data strategy, owning platform decisions, and driving production outcomes across complex systems while modernizing data platforms that support AI-driven products.
Responsibilities:
- Define and lead data architecture across data lakes, warehouses, and pipeline
- Design, build, and optimise scalable batch and real-time data pipeline
- Establish and enforce data governance, quality, and compliance standard
- Implement monitoring, logging, alerting, and CI/CD workflows for data system
- Drive data platform modernisation with a focus on performance, cost, and scalability
- Design data contracts and event-driven integrations with backend and platform team
- Build production data pipelines supporting AI/ML systems, including embeddings, vector stores, RAG workflows, feature stores, and training/inference data flow
- Partner with engineering and leadership teams to translate data strategy into execution
- Mentor junior and mid-level engineers and raise overall data engineering standard
Requirements:
- 7+ years of professional experience in data engineering, including ownership of complex data platforms
- Strong background in distributed systems and data architecture
- Expert-level proficiency in Python, SQL, and either Scala or equivalent JVM-based languages
- Extensive experience with cloud-native data platforms and enterprise data warehousing
- Hands-on experience with streaming and real-time data systems (e.g., Kafka, Kinesis, Pub/Sub)
- Proven experience building data pipelines for production AI/ML use cases