Site Reliability Engineer
Location: Canada - Remote or Hybrid (office in downtown Toronto)
Type: Full-time | Permanent
A long-standing, privacy-focused technology company operating large-scale global infrastructure is hiring a Site Reliability Engineer to help strengthen and scale its VPN and DNS platforms. This is a hands-on, high-impact role at the intersection of networking, distributed systems, and systems engineering - ideal for someone who’s seen their share of deep debugging, edge cases, routing quirks, and DNS mysteries.
You’ll work across global points of presence, modern DNS systems, routing and anycast configurations, Linux servers, and on-call response. The mission: ensure fast, reliable, uncensored internet access for users worldwide while improving automation, documentation, and operational excellence across a complex distributed environment.
What You’ll Do- Own Linux system administration, performance tuning, and troubleshooting
- Manage and improve global networking architecture, including routing, anycast, and VPN infrastructure
- Diagnose and resolve DNS issues across diverse environments and operating systems
- Lead and automate operational workflows using Python, Bash, and configuration management tools
- Support new software launches and ensure smooth deployment into production
- Participate in a critical on-call rotation, handling incident response with autonomy
- Build and maintain clear, robust documentation, checklists, and operational processes
- Strengthen system reliability through better monitoring, validations, and automation
- Strong background in networking fundamentals (OSI, TCP/IP, UDP/TCP, routing, subnetting)
- Deep understanding of DNS: record types, resolution paths, server roles, caching, DNSSEC, etc.
- Fluency with packet-level troubleshooting (tcpdump, Wireshark, termshark)
- Experience with routing and anycast; familiarity with BGP is a major plus
- Proven ability to work independently and deliver high-quality operational outcomes
- Hands-on experience with Python and Bash for automation
- Comfort supporting a global infrastructure environment and participating in on-call work
- Experience with Power
DNS, BIND, Unbound, or similar DNS servers - Exposure to Prometheus, PromQL, and Loki for observability
- Familiarity with VyOS, Juniper, or other networking systems
- Experience with Git Lab CI/CD or similar automation pipelines
- Background working with VPN, security, or distributed networking products
- Impact: Direct ownership over systems that millions of users rely on globally
- Depth: Work on real networking challenges — BGP, anycast, DNS resolution, global routing
- Autonomy: High trust environment with room to architect, improve, and innovate
- Culture: Low-ego, engineering-first team with flexible work and strong technical standards
- Mission: Contribute to technologies that support privacy, security, and open internet access
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: