Lead Site Reliability Engineer- Application Support - Technology Support Lead
Listed on 2026-01-15
-
IT/Tech
IT Support, Systems Administrator
Propel operational success with your expertise in technology support and a commitment to continuous improvement.
As a Technology Support lead in Corporate Technology, you will ensure the operational stability, availability, infrastructure mgmt. and performance of our production application flows. Encourage a culture of continuous improvement as you troubleshoot, maintain, identify, elevate, and resolve production service interruptions for all internally and externally developed systems, leading to a seamless user experience. Critical thinking while overseeing day-to-day maintenance of the firm’s systems will be key and set you up for success as you navigate tasks related to identifying, troubleshooting, and resolving issues to ensure a seamless user experience.
Jobresponsibilities
- Provides end-to-end application and infrastructure service delivery to enable successful business operations of the firm
- Supports the day-to-day maintenance of the firm’s systems to ensure operational stability and availability
- Execute policies and procedures that ensure operational stability and availability
- Analyze complex situations and trends to anticipate and solve incident, problem, and change management in support of full stack technology systems, applications, or infrastructure.
- Design system automation, build helpers and tools to reduce the human toil of application support
- Monitor production environments for anomalies, address issues, and drive evolution of utilization of standard observability tools
- Escalate and communicate issues and solutions to the business and technology stakeholders, actively participating from incident resolution to service restoration
- Lead incident, problem, and change management in support of full stack technology systems, applications, or infrastructure
- 8+ years of experience or equivalent expertise troubleshooting, resolving, and maintaining information technology services
- Experience managing applications or infrastructure in a large-scale technology environment both on premises and public cloud
- Experience in observability and monitoring tools and techniques, setting them up for large scale applications and services.
- Incident management – experienced in production incident management, root cause analysis (RCA) and related follow ups to ensure closure of the issue
- Role implies collaboration with all levels of seniority including the developers, business users and global production management teams.
- Hands on experience on scripting language, any programming language and debugging for production issue and system automations.
- Hands on experience with private/public cloud, Kubernetes or equivalent.
- Strong understanding of infrastructure costing, with the ability to apply tuning mechanisms that optimize both performance and cost efficiency.
- Experience with Service Now platforms
- Understanding of software life cycle development
- Knowledge of document management system
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).