Pluribus Digital is a digital services consultancy that partners with government customers to improve public services. The Data Engineer will build scalable data platforms and pipelines, collaborating with various stakeholders to transform complex data into actionable insights while ensuring data quality and performance.
Responsibilities:
- Design, develop, and maintain scalable data pipelines and ETL/ELT workflows
- Build and optimize data models, warehouses, and data lakes to support analytics and operational applications
- Develop batch and real-time data processing solutions using modern cloud platforms
- Collaborate with software engineers to integrate data services into customer-facing applications
- Ensure data quality, integrity, governance, and observability across platforms
- Optimize database performance, storage, and query efficiency
- Implement automated testing, monitoring, and deployment pipelines for data infrastructure
- Work with product teams and client stakeholders to understand business requirements and translate them into technical solutions
- Troubleshoot production issues and continuously improve system reliability
- Contribute to architecture decisions and engineering best practices
- Participate in Agile ceremonies including sprint planning, standups, retrospectives, and backlog refinement
- Mentor junior engineers and contribute to a collaborative engineering culture
Requirements:
- 3+ years of professional experience building and maintaining data pipelines and data platforms
- Experience developing ETL/ELT processes using Python, SQL, or similar programming languages
- Experience working with relational and non-relational databases
- Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform
- Strong SQL skills and experience optimizing complex queries
- Experience with data modeling and schema design
- Familiarity with distributed data processing frameworks such as Apache Spark
- Experience using Git and CI/CD practices
- Understanding of data governance, security, and privacy best practices
- Strong communication and collaboration skills
- Experience working within Agile software development teams
- Experience with AWS data services such as Glue, Athena, Redshift, EMR, Lambda, S3, and Lake Formation
- Experience with modern orchestration tools such as Apache Airflow or Dagster
- Experience with streaming technologies such as Kafka or Kinesis
- Experience building data warehouses and lakehouse architectures
- Familiarity with Infrastructure as Code (Terraform or CloudFormation)
- Experience supporting federal government programs
- Experience working within digital services or consulting environments
- Active Public Trust or ability to obtain one