Design and develop scalable data processing pipelines using distributed computing frameworks to handle large-scale enterprise datasets
Build and maintain ETL processes that extract, transform, and load data into strategic information products supporting organizational goals
Develop data-intensive applications and services using modern programming languages and cloud-based analytics platforms
Implement stream processing solutions using real-time data processing technologies to support business-critical operations
Collaborate with cross-functional teams to translate complex business requirements into robust technical solutions
Provide production support and troubleshooting for data systems, ensuring high availability and performance
Requirements
6+ years of hands-on experience in technology application development and production support
5+ years of experience developing data pipelines that extract, transform, and load data using programming languages such as Python, Scala, or Java
Minimum 3+ years of experience developing and supporting ETL processes using cloud-based analytics platforms (such as Databricks, Snowflake, or Azure Synapse)
Experience building data-intensive applications using modern programming technologies (including but not limited to C#, Java, Python, Scala, and SQL)
Proficiency with cloud computing environments (such as AWS, Azure, or Google Cloud Platform)
Experience with stream processing technologies (such as Apache Kafka, Apache Pulsar, or Amazon Kinesis) for real-time data processing (preferred)
Hands-on experience with distributed processing frameworks (such as Apache Spark, Dask, or Hadoop MapReduce) for large-scale data analytics (preferred)
Knowledge of data integration tools (such as Apache NiFi, Talend, or Informatica) and database replication technologies (preferred)
Experience with CI/CD pipelines, containerization technologies (such as Docker, Podman, or containerd), and orchestration platforms (such as Kubernetes, Docker Swarm, or OpenShift) (preferred)
Tech Stack
Apache
AWS
Azure
Cloud
Docker
ETL
Google Cloud Platform
Hadoop
Informatica
Java
Kafka
Kubernetes
MapReduce
OpenShift
Pulsar
Python
Scala
Spark
SQL
Benefits
Health & Wellness: Health care coverage designed for the mind and body.
Flexible Downtime: Generous time off helps keep you energized for your time on.
Continuous Learning: Access a wealth of resources to grow your career and learn valuable new skills.
Invest in Your Future: Secure your financial future through competitive pay, retirement planning, a continuing education program with a company-matched student loan contribution, and financial wellness programs.
Family Friendly Perks: It’s not just about you. S&P Global has perks for your partners and little ones, too, with some best-in class benefits for families.
Beyond the Basics: From retail discounts to referral incentive awards—small perks can make a big difference.