Ramp is building the smart infrastructure for finance teams, embedded in the transaction flow of every dollar a business spends. The Production Engineer role focuses on building and operating critical infrastructure, driving architectural changes, and enabling AI-native engineering to support Ramp's growth and reliability.
Responsibilities:
- Build and operate critical infrastructure across Ramp's compute, storage, messaging, and observability stack — owning the systems that handle real financial transactions at scale
- Drive architectural change — not just flag problems. When you surface a reliability or scalability issue, you own the path forward: you propose the solution, find the owners across engineering, and stay in until it's resolved
- Partner with product teams at the design phase — reviewing architectures, embedding golden paths, and making it easy to build correctly the first time
- Build Ramp's next level of scale — you'll be a hands-on contributor to the most consequential infrastructure shift happening right now: our move to a cellular architecture, enabling Ramp to scale, reach international markets, operate in highly regulated and constrained environments (e.g. FedRAMP), and deliver on enterprise-grade SLAs
- Enable AI-native engineering — as Ramp builds increasingly AI-powered products, PE is the team that makes sure the platform can support them. You'll proactively partner with product teams on AI infrastructure patterns, define the golden paths that turn one-off solutions into reusable foundations, and stay ahead of emerging challenges before they become blockers
- Build developer tooling and self-service infrastructure — so that other teams can answer their own questions (cost, performance, reliability) without involving PE
- Participate in on-call rotation — and more importantly, use every incident as a signal to eliminate the root cause, not just resolve the symptom
- Lead across the company — PE doesn't just review designs or show up when called. We proactively initiate cross-team architectural reviews, and we take full ownership of company-wide reliability and scalability initiatives: identifying the problem, proposing the solution, aligning the stakeholders, and staying in until it's done
Requirements:
- 2+ years of software engineering experience shipping high-quality architectures for critical systems
- Strong software engineering fundamentals — you write clean, well-tested, production-ready code
- Hands-on experience with distributed systems at production scale
- Experience with at least one major cloud provider (AWS preferred)
- Familiarity with observability practices (SLOs, error budgets, alerting, dashboards)
- Track record of leading technical projects end-to-end, including cross-team coordination
- Comfortable using AI tooling and coding agents as part of your everyday engineering workflow — we expect our engineers to leverage these tools to move faster and think bigger
- Experience with cellular or multi-tenant architecture patterns
- Prior work on workflow orchestration systems (Temporal)
- Contributions to developer experience or internal platform tooling
- Experience in fintech, payments, or regulated industries (FedRAMP, SOC 2)