Noctua Technology is a software engineering and consulting corporation focused on data engineering, machine learning, and cloud technologies. They are seeking a talented Data Engineer to construct and optimize data pipelines, implement storage solutions, and support customer data-driven initiatives.
Responsibilities:
- Acquire, clean, and preprocess diverse datasets from various sources
- Build required infrastructure for optimal extraction, transformation and loading of data from various data sources using CSP managed services and SQL technologies
- Develop and maintain data pipelines to ensure a continuous flow of high-quality data
- Develop data migration strategies and schemas to lead customer migrations from on-prem to cloud technologies
- Perform data migration activities
- Optimize databases and data warehouses for efficient querying and data storage
- Perform exploratory data analysis to uncover patterns, trends, and insights
- Create visualizations and reports to communicate findings effectively to stakeholders both internally and externally
- Collaborate with cross-functional teams, including software engineers, domain experts, and business analysts, to understand requirements and deliver integrated solutions
- Create and maintain comprehensive documentation for code, algorithms, and models. Ensure that the knowledge is shared and accessible within the team
- Act on client feedback constructively to improve services and outcomes
- Continuously seek ways to enhance the overall customer experience
- Stay updated on the latest developments in machine learning, data science, and analytics
- Drive innovation by proposing and implementing new techniques and technologies
Requirements:
- Solid understanding and experience with SQL and relational database concepts
- Solid understanding of database technologies, data warehouses, and ETL tools (e.g., MySQL, PostgreSQL, Beam, Airflow, and Kafka)
- Experience with data analysis tools (eg., Jupyter, Colab, Pandas)
- Experience with data visualization tools (eg., Tableau, Looker, PowerBI, Qlik, and SuperSet)
- Previous experience developing data strategies and facilitating data migrations into production systems
- Experience with cloud platforms (e.g., AWS, Azure, GCP)
- Proficiency in programming languages such as Python, Java, or C++
- Strong software engineering skills with an emphasis on writing clean, modular, and maintainable code
- Familiarity with version control systems (e.g., Git) and collaborative development workflows
- Excellent problem-solving and critical-thinking skills
- Effective communication skills and ability to work in a collaborative team environment
- Bachelor's or advanced degree in Computer Science, Data Science, Machine Learning, or a related field
- Experience with other database technologies (eg., NoSQL, Graph)
- Google Cloud Professional Cloud Architect
- Google Cloud Professional Database Engineer certification
- Google Cloud Professional Data Engineer
- Experience with additional data processing tools and technologies (e.g., Spark, Hadoop)
- Knowledge of containerization and orchestration tools (e.g., Docker, Kubernetes)