Build & Operate Data Pipelines (Batch + Streaming)
Design and implement batch and streaming ingestion from APIs, relational databases, file drops, event streams, and external partners.
Build and optimize ETL/ELT pipelines to produce curated, analytics-ready datasets for reporting and ML consumption.
Implement incremental processing patterns, change data capture (CDC) approaches where appropriate, and data contract standards.
Deliver a Modern Lakehouse (Data Lake / Delta Lake)
Build and manage a scalable lakehouse on AWS object storage (e.g., S3) using open table/file formats and delta/lakehouse concepts (e.g., ACID tables, schema evolution, time travel patterns).
Optimize performance and cost through partitioning, compaction, lifecycle policies, and efficient compute/storage usage.
Establish environment standards for dev/test/prod and consistent promotion across stages.
Must be able to OBTAIN and MAINTAIN a Federal or DoD "PUBLIC TRUST"; candidates must obtain approved adjudication of their PUBLIC TRUST prior to onboarding with Guidehouse. Candidates with an ACTIVE PUBLIC TRUST or SUITABILITY are preferred.
Tech Stack
AWS
Cloud
ETL
Java
Python
Scala
SDLC
SQL
Benefits
Medical, Rx, Dental & Vision Insurance
Personal and Family Sick Time & Company Paid Holidays
Parental Leave
401(k) Retirement Plan
Group Term Life and Travel Assistance
Voluntary Life and AD&D Insurance
Health Savings Account, Health Care & Dependent Care Flexible Spending Accounts
Transit and Parking Commuter Benefits
Short-Term & Long-Term Disability
Tuition Reimbursement, Personal Development, Certifications & Learning Opportunities
Employee Referral Program
Corporate Sponsored Events & Community Outreach
Care.com annual membership
Employee Assistance Program
Supplemental Benefits via Corestream (Critical Care, Hospital Indemnity, Accident Insurance, Legal Assistance and ID theft protection, etc.)