SIMARN Solutions is seeking an experienced Data Engineer to support the development of modern cloud-based data platforms. The ideal candidate will have expertise in Databricks, Azure Data Services, and scalable data pipeline development, focusing on designing and maintaining data pipelines and collaborating with various teams.
Responsibilities:
- Design, develop, and maintain scalable data pipelines using Databricks, PySpark, and Python
- Build and orchestrate workflows using Databricks Workflows and Azure Data Factory
- Develop and optimize config-driven data pipelines for enterprise-scale data processing
- Implement and support Databricks SQL solutions for reporting and analytics
- Create and maintain efficient Data Models, including Star Schema and Snowflake Schema
- Work with modern open table formats such as Delta Lake and Apache Iceberg
- Leverage Databricks Genie for AI-driven insights and conversational analytics
- Utilize GitHub Copilot and other AI-assisted development tools to improve engineering productivity
- Support data virtualization strategies and enterprise data integration initiatives
- Collaborate with business stakeholders, data architects, and analytics teams to deliver high-quality solutions
Requirements:
- Strong hands-on experience with Databricks
- Azure Data Factory (ADF)
- Databricks Workflows
- PySpark
- Python
- Databricks SQL
- Data Modeling
- Experience building config-driven ETL/ELT pipelines
- Expertise in Star Schema and Snowflake Schema design
- Experience with Delta Lake and/or Apache Iceberg
- Strong understanding of cloud-based data engineering, preferably on Microsoft Azure
- Experience with data virtualization concepts and implementation