Site Reliability Engineer Job Atlanta area,Georgia USA,IT/Tech

Who is Rainforest?

Rainforest is an early stage payments-as-a-service startup that has developed a solution that makes monetizing payments for vertically focused software platforms fair and simple. We focus on small-to-mid sized platforms that want to add value to their small business customers through embedded payments — and want to do so without adding operational or regulatory burdens or costs. Led by a successful repeat fintech founder with deep knowledge of the industry, and with venture backing from a Top 10 VC firm, Rainforest is well positioned to make an impact in the payments-as-a-service space, and we're looking for new team members who want to be a part of this journey!

Who

We’re Looking For

We’re looking for a proactive, hands‑on Site Reliability Engineer who thrives in building and scaling cloud infrastructure in fast‑moving startup environments. You’re someone who enjoys owning systems end‑to‑end — from infrastructure design to production reliability — and partnering closely with engineers to ship secure, scalable payment platforms. You bring a strong technical foundation, a problem‑solving mindset, and a passion for automation, performance, and continuous improvement.

If you’re excited about making a real impact in fintech, and helping shape SRE practices as the company grows, you’ll feel right at home at Rainforest.

What are some of the high‑impact opportunities you’ll tackle?

Owning and scaling Rainforest’s Amazon Web Services (AWS)-based cloud infrastructure using Terraform and infrastructure-as-code (IaC) orchestration
Building, operating, and continuously improving Elastic Kubernetes Service (EKS) and serverless environments that support our core payments services
Designing and maintaining modern CI/CD pipelines with Git Lab to enable fast, safe deployments
Implementing and evolving monitoring, alerting, and observability to ensure high uptime and quick incident resolution using tools like Open Telemetry, Prometheus, and New Relic
Automating infrastructure and operational processes to eliminate manual work and accelerate delivery
Working side‑by‑side with application engineers to improve system performance, reliability, and scalability
Leading incident response efforts, conducting postmortems, and driving continuous improvement
Helping to define and roll out SRE best practices, including SLIs, SLOs, and error budgets as the company scales
Optimizing for cost, security, and compliance in a regulated fintech environment
Supporting and scaling Postgres database infrastructure using AWS RDS offerings (Global Aurora)

This opportunity is for you if you have / are:

3+ years of experience in SRE, Dev Ops, or cloud infrastructure roles (startup or high‑growth experience a plus)
Passion for building reliable systems that scale with the business
Strong hands‑on experience with cloud infrastructure (AWS, Google Cloud, Azure)
Deep experience with IaC using tools such as Terraform, Open Tofu, Terragrunt, and Cloud Formation
Solid production experience with container orchestration (Kubernetes, ECS)
Experience building CI/CD pipelines using tools like Git Lab and Git Hub Actions
Strong understanding of monitoring and observability principles and design and providing dashboards, visualizations and alerts
Proficiency in at least one modern programming language (e.g., Python, Java, Go, or Ruby)
Bachelor’s degree or equivalent work experience in the areas of Information Science, Computer Science, or related disciplines is preferred

We offer a comprehensive health benefits package, unlimited paid time off, paid parental leave, a fun and flexible working environment, and continuously invest in our people and our culture.

If you require any accommodations throughout the pre‑employment process, please contact our HR team at

#J-18808-Ljbffr


Increase/decrease your Search Radius (miles)



Job Posting Language