Programmers.io is a company focused on data engineering and site reliability engineering. They are seeking a Capacity Engineer with expertise in capacity planning to ensure infrastructure scales efficiently to meet user demand while balancing performance and cost.
Responsibilities:
- Design and maintain ETL/ELT pipelines to collect, transform, and store infrastructure usage data
- Build models to analyze system metrics and predict future resource needs
- Analyze historical usage patterns to predict CPU, memory, and storage requirements
- Simulate traffic spikes to identify bottlenecks and ensure systems scale linearly
- Optimize resource allocation to avoid unnecessary costs while maintaining service availability
- Use Infrastructure as Code (IaC) tools like Terraform to automate scaling and provisioning
- Collaborate with software teams to flag single points of failure and ensure resilient service design
Requirements:
- Strong background in data engineering and SRE practices
- Hands-on experience with capacity planning, forecasting, and scaling
- Proficiency in IaC tools (Terraform, Ansible, Harness)
- Experience with data pipelines, ETL/ELT frameworks, and big data tools
- Familiarity with monitoring/observability platforms (Prometheus, Grafana, Datadog)
- Knowledge of chaos engineering and resilience testing
- Excellent collaboration and communication skills