Support and enhance the data engineering components of the platform, ensuring availability, performance, scalability, and reliability of processing environments.
Monitor processes, resolve incidents, implement continuous improvements, and develop new features.
Collaborate with architecture, development, and business teams to ensure data quality and governance.
Monitor and ensure availability of data processing environments.
Identify, analyze, and resolve incidents in production environments.
Implement improvements to optimize performance, scalability, and reliability of data pipelines.
Investigate inconsistencies, validate data quality, and implement corrective and preventive actions.
Develop and maintain processing pipelines using Azure Databricks (PySpark and Spark SQL).
Create and maintain ingestion and orchestration workflows using Azure Data Factory.
Document technical solutions, processes, flows, and operational procedures.
Contribute to the evolution of architecture and best practices in data engineering.
Requirements
Experience with Microsoft Azure.
Experience with Azure Databricks (PySpark and Spark SQL).
Experience with Azure Data Factory (ADF).
Experience with Azure Synapse Analytics.
Proficiency in Python.
Experience with PySpark.
Strong SQL skills (Spark SQL and T-SQL).
Experience developing and supporting data pipelines.
Experience monitoring and resolving incidents in production environments.
Knowledge of data quality, integrity, and governance.
Familiarity with data architecture best practices in the cloud.
Experience optimizing performance of data pipelines.
Experience with distributed processing using Apache Spark.
Code versioning (Git/Azure DevOps).
Understanding of Data Lake, Data Warehouse and ETL/ELT concepts.
Experience with monitoring and observability of data environments.
Experience with agile methodologies (Scrum/Kanban).
Experience working in critical production environments.
Bachelor's degree in Computer Science, Computer Engineering, Information Systems, Software Engineering, Systems Analysis and Development, or related fields.
Postgraduate degree or certifications in Data Engineering, Cloud Computing, or Microsoft Azure is a plus.
Tech Stack
Apache
Azure
Cloud
ETL
PySpark
Python
Spark
SQL
Benefits
Company-subsidized Health Insurance for the employee.
Option to include dependents in the Health Insurance with payroll deduction.
Dental care (optional).
Option to include dependents in Dental Care with payroll deduction.
Meal allowance or Food voucher.
Transportation voucher (optional).
Impact & Care
a personal guidance program offering confidential emotional support and counseling in psychological, legal, financial, social and pet-related areas at no cost for the employee and legal dependents.
Gympass
Wellhub (Access to over 700 gyms throughout Brazil with plans starting at R$ 29.90 deducted via payroll).
Option to include dependents in Gympass
Wellhub (up to 3 dependents
paid via credit card).
Access to Udemy via our intranet.
Partnerships with major consumer brands.
Agreement with SESC for employee and dependents.
Discounts with educational institutions (undergraduate and postgraduate) and language/certification schools.