Role Overview

Architect and build production data pipelines and data platforms that serve models, data, and AI workflows to internal and client-facing applications — and stay accountable for them under live conditions.
Own non-functional quality across your domain: latency and throughput budgets, scalability, reliability, observability, and cost.
Lead the design and operation of multi-model data stores — relational (PostgreSQL, MySQL), vector (Pinecone, Weaviate, pgvector), and graph (Neo4j, Neptune) — applying the right tool to each access pattern, not the most fashionable one.
Set technical direction: write design docs, make build-vs-buy calls, and defend your approach with evidence rather than instinct.
Work across the stack when the problem demands it — services, data access, infrastructure-as-code, CI/CD — and diagnose it when things drift in production.
Raise the floor for the team: mentor mid-level and junior engineers, run rigorous code reviews, and hold the quality bar without making it someone else's job to ask you.

Requirements

5+ years of professional experience shipping and operating production data systems — you've lived through scaling challenges, reliability incidents, and the unglamorous gap between a working prototype and a dependable service.
Deep, demonstrable expertise designing distributed data pipelines with Apache Spark, and strong data modelling instincts across relational, vector, and graph databases.
Strong proficiency in at least one general-purpose language (Python, Scala, or Java), and the ability to work effectively across others when needed.
Hands-on experience with cloud platforms (GCP or AWS), containers (Docker), CI/CD, and infrastructure-as-code (Terraform).
The engineering discipline to maintain what you build — version control, testing, code review, and a low tolerance for technical debt you created.
Comfort with ambiguity. Many of our problems don't arrive with a known-good solution attached.
Communication that scales: you can write a one-page design doc that's useful to both a product stakeholder and a staff engineer without two separate documents.
Bonus Points for:
Experience building and operating ML/LLM-powered production systems at scale — model serving, RAG, agents.
Hands-on work with event-driven or streaming architectures (Pub/Sub, Kafka) and real-time systems.
Depth in security, IAM, networking, or data governance in cloud environments.
Background in marketing technology, ad tech, or large-scale data products.
Meaningful open-source contributions or a visible track record of technical leadership outside your day job.

Tech Stack

Apache
AWS
Cloud
Docker
Google Cloud Platform
Java
Kafka
MySQL
Neo4j
Postgres
Python
Scala
Spark
Terraform

Benefits

Competitive salary and hybrid work model – come hang out in our Athens office or work remotely from anywhere in European economic Area (EU, Switzerland etc.) or UK (up to 6 weeks per year).
Training budget to level up your skills from the top tech partners in the market (Microsoft, AWS, Salesforce, Databricks etc.) – whether it’s certifications or courses, we’ve got you covered.
Private insurance, top-tier tech gear, and the chance to work with a stellar crew.

Senior Data Engineer

Key skills

About this role

Role Overview

Requirements

Tech Stack

Benefits