HMG AMERICA LLC is seeking a highly skilled AWS Data Engineer with strong experience in PySpark and AWS-native data architectures. The role involves building and optimizing scalable ETL/data pipelines and designing AWS-based data engineering solutions to handle large-scale datasets.

Responsibilities:

Build and optimize scalable ETL/data pipelines using PySpark
Design AWS-based data engineering solutions using EMR, Glue, S3, Iceberg, etc
Improve performance and scalability for large-scale data processing systems
Collaborate with architects, analysts, and engineering teams on data initiatives
Ensure reliability, monitoring, and best practices across data platforms

Requirements:

Strong hands-on experience with PySpark
Strong hands-on experience with Apache Iceberg
Experience with EMR + Glue
Deep understanding of AWS native architecture
Experience designing scalable and performant applications/data pipelines
Knowledge of data lake and distributed processing concepts
Experience handling large-scale datasets and optimization techniques
Exposure to Agentic Workflows
Experience with modern AI/data engineering ecosystems

AWS Data Engineer

Key skills

About this role

Responsibilities:

Requirements: