HMG AMERICA LLC is seeking a highly skilled AWS Data Engineer with strong experience in PySpark and AWS-native data architectures. The role involves building and optimizing scalable ETL/data pipelines and designing AWS-based data engineering solutions to handle large-scale datasets.
Responsibilities:
- Build and optimize scalable ETL/data pipelines using PySpark
- Design AWS-based data engineering solutions using EMR, Glue, S3, Iceberg, etc
- Improve performance and scalability for large-scale data processing systems
- Collaborate with architects, analysts, and engineering teams on data initiatives
- Ensure reliability, monitoring, and best practices across data platforms
Requirements:
- Strong hands-on experience with PySpark
- Strong hands-on experience with Apache Iceberg
- Experience with EMR + Glue
- Deep understanding of AWS native architecture
- Experience designing scalable and performant applications/data pipelines
- Knowledge of data lake and distributed processing concepts
- Experience handling large-scale datasets and optimization techniques
- Exposure to Agentic Workflows
- Experience with modern AI/data engineering ecosystems