Infrastructure Observability Engineer
Vancouver, BC, Canada
Listed on 2026-02-28
-
IT/Tech
Cloud Computing, Systems Engineer, Cybersecurity, IT Support
Our Story & Purpose
We’re Vancity, a member-owned credit union built on the principles of inclusion and social justice. Since 1946, our relentless commitment to these values has helped us challenge the status quo and break down barriers. We’ve made bold commitments to become net-zero by 2040 across all mortgages and loans, and we’re actively pursuing strategies in Indigenous banking and financial resilience for our members.
A largest private sector Living Wage Employer in Canada, we’re proud to be consistently recognized as one of the country’s Top Employers. If you’re ready to join our team of 2,700 diverse individuals, access competitive rewards and benefits, and be part of a greater movement – apply today!
Your Role In Supporting Our MembersAs an Infrastructure Observability Engineer
, you will play a critical role in ensuring a smooth and reliable digital experience for our members by designing and implementing monitoring capabilities across our products and services. You will bring extensive cloud and on-premises infrastructure knowledge and skills to collaborate with cross‑functional teams in implementing monitoring, enhancing and improving Vancity’s services and platforms insight. You are responsible for integrating observability into the environment (infrastructure and applications), building custom solutions, automation and queries and configuring dashboard to visualize data.
We are looking for a proactive, technically skilled individual who thrives in problem solving and continuous learning.
You must have a genuine passion for crafting intuitive, business‑aligned Grafana dashboards that are sleek, purposeful and tell the story of system health at a glance.
This is a Full‑time, Permanent role that will enjoy hybrid working arrangements which can be fulfilled primarily from the Vancity head office location and your Lower Mainland based home office. This role will require you to work on‑site at least two days a week and for events and business demands. This requirement might change in future as per business need.
How You’ll Make An Impact- Design, develop, test and deploy observability solutions to monitor performance of services provided to our members.
- Perform relevant code and configuration reviews with team members, ensuring code quality, metric calculation and adherence to the industry’s best practices.
- Provide recommendations for improving product observability, solving issues related to metric collection and working with the product team to define technical observability requirements for future products/services.
- Share observability technical requirements for integration with vendors and partners’ systems and provide guidance to cross‑functional teams on implementing features as a subject matter expert.
- Create observability standards and track metrics to measure performance.
- Build end‑to‑end visibility via Azure Monitor, Log Analytics, Managed Grafana and Application Insights.
- Write queries (KQL), build intuitive dashboards, alerts and automated response rules.
- Improve detection accuracy, reduce alert noise and extend monitoring coverage to all critical services.
- Collaborate with cross‑functional teams such as products, pods and command center on service delivery, offering guidance and conducting troubleshooting sessions to unblock progress and ensure timely completion of work tasks and/or incident recovery.
- Act as an emissary of the Cloud Engineering team, promoting strong partnerships with cross‑functional teams and stakeholders that depend on our services.
- Extensive experience in an IT Enterprise environment or financial sector with bachelor’s degree (or equivalent) in computer science, engineering or related field.
- Strong hands‑on coding experience in NodeJS or JavaScript and Power Shell; experience with custom plugins, adapters or serverless is an advantage.
- Experience with API integration, RESTful API, deep knowledge of TCP/IP, HTTP/S and Containers.
- Extensive experience with GCP, AWS or Azure, Kubernetes ecosystem and On‑Premises/Data Center Infrastructure.
- Familiarity with Git or other version control systems in collaborative development environments.
- Prior…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: