NPS Prism is a market-leading, cloud-based CX benchmarking and operational improvement platform owned by Bain & Company. We are seeking a highly skilled and experienced Data Engineer to design and implement robust data solutions for scalable business needs, utilizing cloud platforms and big data technologies.

Responsibilities:

Design, build, and optimize ETL/ELT workflows using tools like Databricks, SQL, Python/pyspark & Alteryx (Good to have)
Develop and maintain robust, scalable, and efficient data pipelines for processing large datasets. from source to emerging data
Work on cloud platforms (Azure, AWS) to build and manage data lakes, data warehouses, and scalable data architectures
Utilize cloud services like Azure Data Factory, AWS Glue, or for data processing and orchestration
Use Databricks for big data processing, analytics, and real-time data processing
Leverage Apache Spark for distributed computing and handling complex data transformations
Create and manage SQL-based data solutions, ensuring high availability, scalability, and performance
Develop and enforce data quality checks and validation mechanisms
Collaborate with cross-functional teams, including data scientists, analysts, and business stakeholders, to deliver impactful data solutions
Understand business requirements and translate them into technical solutions
Leverage CI/CD pipelines to streamline development, testing, and deployment of data engineering workflows
Work with DevOps tools like Git, Jenkins, or Azure DevOps for version control and automation
Maintain clear documentation for data workflows, pipelines, and processes
Optimize data systems for performance, scalability, and cost-efficiency

Requirements:

Bachelor's or Master's degree in Computer Science, Information Technology, Engineering, or a related field
3–6 years of experience in Data Engineering or related roles
Hands-on experience with big data processing frameworks, data lakes, and cloud-native services
Proficiency in Python, SQL, and PySpark for data processing and manipulation
Proven experience in Databricks and Apache Spark
Expertise in working with cloud platforms like Azure, AWS
Sound knowledge of ETL processes and tools like Alteryx
Leveraging data lakes, data warehouses, and data pipelines
Build a Data Pipeline from scratch
Strong understanding of distributed systems and big data technologies
Basic understanding of DevOps principles and familiarity with CI/CD pipelines
Hands-on experience with tools like Git, Jenkins, or Azure DevOps
Strong problem-solving skills and a knack for optimizing data solutions
Excellent communication (oral and written) skills
Familiarity with data visualization tools like Power BI, Tableau, or similar
Knowledge of streaming technologies such as Kafka or Event Hubs

Data Engineer II

Key skills

About this role

Responsibilities:

Requirements: