SRE Application Support
Job Title:
SRE Application Support
Location:
Toronto, ON (4x onsite a week)
Employment Type:
Contract
Pay Rate: CAD $50/HR INC
Interview Type:
Face 2 Face (Onsite Interview Only)
We are seeking an experienced Site Reliability Engineer SRE to join the Application Support team for a large scale mission critical banking platform supporting Canada’s leading financial institution. The role focuses on ensuring high availability stability and performance of enterprise applications running across Mainframe OnDemand and AIX environments.
The ideal candidate will have strong production support experience, a proactive SRE mindset and the ability to work in a high availability regulated banking environment.
Key Responsibilities- Provide Level 2 application support for critical banking applications running on Mainframe AIX and OnDemand platforms.
- Monitor application health, system performance and batch processing to ensure 24x7 availability and reliability.
- Troubleshoot and resolve production incidents, perform root cause analysis (RCA) and implement preventive measures.
- Support Mainframe batch jobs scheduling and OnDemand report processing.
- Work closely with development, infrastructure and operations teams to ensure smooth releases and deployments.
- Participate in incident management, problem management and change management processes.
- Drive SRE best practices such as automation, resilience, monitoring and operational excellence.
- Create and maintain runbooks, SOPs and operational documentation.
- Provide on‑call support as part of a rotational support model.
- Strong experience in Application Support / SRE production support roles.
- Hands‑on experience with mainframe environments, batch processing, job monitoring and production support.
- Experience with IBM OnDemand or similar report management tools.
- AIX / Unix / Linux systems administration and troubleshooting.
- Experience supporting mission‑critical applications in large enterprise or banking environments.
- Strong understanding of incident, problem and change management processes.
- Ability to analyze logs, job failures and system alerts to quickly restore services.
- Excellent communication skills and ability to work with cross‑functional teams.
- Experience in banking or financial services domain.
- Exposure to automation, scripting or monitoring tools.
- Familiarity with ITIL processes.
- Prior experience working in regulated high availability environments.
Disclaimer: AI tools may assist in the recruitment process; however, all hiring decisions are made by the recruitment team based on a comprehensive evaluation of candidates.
#J-18808-LjbffrTo Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: