Spotify is a leading music streaming service, and they are seeking a Data Engineer to join their Personalization team. The role involves designing and maintaining data pipelines to enhance user experiences for Spotify Wrapped, collaborating with cross-functional teams to transform listening behavior into personalized data stories for millions of users.
Responsibilities:
- Design, build, and maintain distributed data pipelines that power Spotify Wrapped data stories and personalized experiences for more than 300M users globally
- Partner with Data Scientists to evaluate and operationalize new Wrapped story concepts, balancing personalization, scalability, and eligibility requirements
- Build scalable systems that process large-scale listening data and generate insights that celebrate users’ unique listening journeys
- Develop and optimize pipelines supporting AI-powered personalized playlist experiences and recommendation technologies
- Collaborate with partner teams to integrate social and shared listening experiences into Wrapped and adjacent user experiences
- Contribute to technical excellence by improving reliability, observability, performance, and development velocity across the squad’s data systems
- Support experimentation and iteration on new storytelling concepts beginning early in the product development cycle
- Work cross-functionally with engineering, product, design, and music domain experts to bring large-scale personalized experiences to life
Requirements:
- Experience of working in a product-driven engineering environment
- You have experience working with high-volume, heterogeneous datasets using distributed systems and big data technologies such as Python, Scala, Scio, Ray, Apache Spark, or similar frameworks
- You are proficient in designing and building distributed data pipelines in Python, Scala, or Java, including experience with frameworks such as Scio and platforms like Dataflow
- You understand data modeling, data access, and storage techniques across both batch and analytical processing systems
- You have experience working with large-scale analytical systems such as BigQuery or similar technologies
- You value iterative software development, data-driven decision making, reliability, responsible experimentation, and cost-efficient engineering practices
- You thrive in collaborative environments and enjoy working closely with cross-functional teams across engineering, data science, and product
- You are a creative problem solver who enjoys building products that create meaningful experiences for millions of users
- You are excited by the challenge of turning research ideas and experimental concepts into reliable, scalable production systems