Amazon RedshiftAWSCloudEC2MySQLPostgresPythonSQLMachine LearningData EngineeringAnalyticsRedshiftS3RDSGlueProject ManagementRemote Work
About this role
Role Overview
Supervise junior members of the data engineering team.
Guiding, planning, and reviewing the team's work
Create and maintain optimal data pipeline architecture
Assemble large, complex data sets that meet functional / non-functional business requirements
Extend our machine learning platform by designing tools that interface with cloud services, our current code base, and provide new flexibility in model building
Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL, Python, and AWS
Build analytics tools to provide actionable insights into key business performance metrics, as well as supporting the needs of the analytics team
Create data-handling tools for analytics and data scientist team members that assist them in building and optimizing our decision-making process
Requirements
5+ years of experience in a Data Engineer role
Bachelor's degree in Computer Science, Statistics, Informatics, Information Systems or another quantitative field
Advanced working SQL knowledge and experience working with relational databases (including Postgres and MySQL), query authoring (SQL), as well as working familiarity with a variety of databases
Experience building data pipelines, architectures, and data sets from raw, loosely structured data
A history of focusing on test driven design and results for repeatable and maintainable processes and tools
Experience building processes supporting data transformation, data structures, metadata, dependency, and workload management
Working knowledge of message queuing, stream processing, and highly scalable data stores
Strong project management and organizational skills and the ability to work independently in a fast-paced, quickly changing environment
Ability to keep up with several projects at once and understand the impact of projects within a larger system
Experience supporting and working with cross-functional teams in a dynamic environment
Experience managing junior engineers and guiding a team of engineers through project planning, execution, and quality control stages
Candidate should have experience using the following software/tools:
Experience with object-oriented design in Python
Experience with data pipeline and workflow management tools
Experience with AWS cloud services: EC2, RDS, Redshift, Glue, S3