Site Reliability Engineer Remote
Remote / Online - Candidates ideally in
Quebec, Québec, Province de Québec, Canada
Listed on 2026-02-24
Quebec, Québec, Province de Québec, Canada
Listing for:
Braintrust
Remote/Work from Home
position Listed on 2026-02-24
Job specializations:
-
IT/Tech
Cloud Computing, SRE/Site Reliability, IT Support, IT Project Manager
Job Description & How to Apply Below
Location: Quebec
Job Description
We’re building a secure, reproducible foundation for software everywhere, that makes building, maintaining, and releasing software predictable and effortless for everyone. We’re looking to carefully grow our team with people that are excited about building a workflow that feels light years ahead of everything else.
Our technology stack is built on GCP and Cloudflare using Go, Rust, Typescript, Postgres, Linux and container images.
Your work will directly shape the way developers experience and trust our product, and influence the way we grow as a team.
Please Note: Salary range (150,) is in CAD per year.
The Role- Managing our cloud services on GCP and Cloud Flare
- Ensuring we meet our SLOs
- Managing monitoring and logging systems
- Maintaining a strong security posture
- Managing our CI/CD systems
- Managing incident response and on-call
- Automating all-the-things
- Helping manage our Google Workspace and other third-party SaaS systems
- Develop and maintain automated pipelines for building, testing, and deploying code.
- Provision, manage, and scale infrastructure on platforms like GCP and Cloud Flare.
- Third‑party SaaS management
- Implement observability tools to monitor system health, performance, and set up alerts.
- Managing all incident responses. Troubleshoot, resolve, and conduct root‑cause analysis for production issues. Writing processes and documentation for incident response that will scale as we grow.
- Craft and manage a robust on‑call system that will scale as the company grows
- Script and build tools to automate operational tasks
- Implement security best practices across infrastructure and applications.
- Create and maintain clear documentation for processes and systems.
- Work closely with development teams to optimize designs, ensure smooth releases and reliable applications.
- 5+ years of experience in a similar role
- Experience with GCP
- Proficiency in Python, Bash, or Shell scripting.
- Experience with CI/CD tools like Jenkins, Git Lab CI/CD, Git Hub Actions.
- Experience with Docker and Kubernetes.
- You champion Infrastructure as Code and have experience with Terraform and Ansible.
- Strong working knowledge of Prometheus, Grafana, ELK Stack.
- Strong understanding of OSes (Linux), networking, and distributed systems.
- Experience at an early‑stage startup
- Background in security, developer tooling, open source and package management
- Competitive base compensation
- Equity stake in the company
- Health, dental, vision and life insurance plans
- Retirement plans with employer matching
- Remote‑first work environment
- Flexible PTO with a minimum suggested utilization of 3 weeks
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×