Site Reliability Engineer
Listed on 2026-02-28
-
IT/Tech
Cloud Computing, SRE/Site Reliability, Systems Engineer
Who is Rainforest?
Rainforest is an early stage payments-as-a-service startup that has developed a solution that makes monetizing payments for vertically focused software platforms fair and simple. We focus on small-to-mid sized platforms that want to add value to their small business customers through embedded payments — and want to do so without adding operational or regulatory burdens or costs. Led by a successful repeat fintech founder with deep knowledge of the industry, and with venture backing from a Top 10 VC firm, Rainforest is well positioned to make an impact in the payments-as-a-service space, and we're looking for new team members who want to be a part of this journey!
WhoWe’re Looking For
We’re looking for a proactive, hands‑on Site Reliability Engineer who thrives in building and scaling cloud infrastructure in fast‑moving startup environments. You’re someone who enjoys owning systems end‑to‑end — from infrastructure design to production reliability — and partnering closely with engineers to ship secure, scalable payment platforms. You bring a strong technical foundation, a problem‑solving mindset, and a passion for automation, performance, and continuous improvement.
If you’re excited about making a real impact in fintech, and helping shape SRE practices as the company grows, you’ll feel right at home at Rainforest.
- Owning and scaling Rainforest’s Amazon Web Services (AWS)-based cloud infrastructure using Terraform and infrastructure-as-code (IaC) orchestration
- Building, operating, and continuously improving Elastic Kubernetes Service (EKS) and serverless environments that support our core payments services
- Designing and maintaining modern CI/CD pipelines with Git Lab to enable fast, safe deployments
- Implementing and evolving monitoring, alerting, and observability to ensure high uptime and quick incident resolution using tools like Open Telemetry, Prometheus, and New Relic
- Automating infrastructure and operational processes to eliminate manual work and accelerate delivery
- Working side‑by‑side with application engineers to improve system performance, reliability, and scalability
- Leading incident response efforts, conducting postmortems, and driving continuous improvement
- Helping to define and roll out SRE best practices, including SLIs, SLOs, and error budgets as the company scales
- Optimizing for cost, security, and compliance in a regulated fintech environment
- Supporting and scaling Postgres database infrastructure using AWS RDS offerings (Global Aurora)
- 3+ years of experience in SRE, Dev Ops, or cloud infrastructure roles (startup or high‑growth experience a plus)
- Passion for building reliable systems that scale with the business
- Strong hands‑on experience with cloud infrastructure (AWS, Google Cloud, Azure)
- Deep experience with IaC using tools such as Terraform, Open Tofu, Terragrunt, and Cloud Formation
- Solid production experience with container orchestration (Kubernetes, ECS)
- Experience building CI/CD pipelines using tools like Git Lab and Git Hub Actions
- Strong understanding of monitoring and observability principles and design and providing dashboards, visualizations and alerts
- Proficiency in at least one modern programming language (e.g., Python, Java, Go, or Ruby)
- Bachelor’s degree or equivalent work experience in the areas of Information Science, Computer Science, or related disciplines is preferred
We offer a comprehensive health benefits package, unlimited paid time off, paid parental leave, a fun and flexible working environment, and continuously invest in our people and our culture.
If you require any accommodations throughout the pre‑employment process, please contact our HR team at
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).