×
Register Here to Apply for Jobs or Post Jobs. X

Senior Site Reliability Engineer

Job in Kelowna, BC, U1X, Canada
Listing for: Orion Innovation
Full Time position
Listed on 2026-02-27
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing
Job Description & How to Apply Below

Overview

Senior Site Reliability Engineer (SRE) with Kubernetes and Rancher. Full-time role focused on building and maintaining highly resilient, secure systems, including in air-gapped environments.

Responsibilities
  • System Architecture & Management: Design, architect, and maintain highly reliable, multi-tenant systems using Kubernetes and related tools (RKE2). Includes components such as Ingress, Kong, Artifactory, and Sonar.
  • Observability & Monitoring: Implement and manage observability solutions with Prometheus, Grafana, Splunk, and Elastic to ensure deep visibility into system health and performance, including in air-gapped settings.
  • Compliance & Optimization: Ensure deployments meet stringent compliance standards and are optimized for performance and security.
  • Code Quality & Security: Perform regular code quality analysis and security assessments using Sonar to identify and mitigate vulnerabilities.
  • Incident Response: Collaborate with leads and specialized teams to resolve incidents quickly and improve resilience and recovery procedures.
  • Documentation: Create and maintain documentation for system configurations, runbooks, and disaster recovery plans for managing systems in sensitive environments.
Required

Skills and Qualifications
  • 8+ years of Site Reliability Experience.
  • Experience with Kubernetes and Rancher.
  • Technical Expertise: Proficiency with RKE2, Kubernetes, Ingress, Kong, Artifactory, Prometheus, Grafana, Splunk, Elastic, and Sonar.
  • SRE & Observability: Strong background in Site Reliability Engineering and implementing comprehensive observability strategies.
  • Secure Environments: Experience in air-gapped or zero-connectivity environments and protecting classified data.
  • Troubleshooting: Ability to troubleshoot and optimize complex, multi-tenant infrastructures under pressure.
Preferred Qualifications
  • Relevant SRE or Dev Ops certifications (e.g., CKAD, CKA).
  • Experience in government or defense-related SRE roles.
  • Experience with Rancher and its ecosystem.
Seniority level
  • Mid-Senior level
Employment type
  • Full-time
Job function
  • Engineering and Information Technology
Industries
  • IT Services and IT Consulting
#J-18808-Ljbffr
Position Requirements
10+ Years work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary