More jobs:
Systems Reliability Engineer
Job in
New York, New York County, New York, 10261, USA
Listed on 2026-03-05
Listing for:
mthree Recruiting Portal
Full Time
position Listed on 2026-03-05
Job specializations:
-
IT/Tech
Cloud Computing, Systems Engineer, SRE/Site Reliability
Job Description & How to Apply Below
Market leading investment bank requires a Systems Reliability Engineer join their Reliability & Production Engineering department. This role supports Institutional Securities and Wealth Management brokerage Operations platforms which include diverse technologies hosted by on premises and cloud platforms. The role is expected to perform day to day support for the business alongside reliability engineering tasks. The role has an emphasis on improving the reliability of our systems by working with the Software developers and Infrastructure engineering teams to develop automated reliability solutions.
Responsibilities- You will spend time on production management, inclusive of: incident and problem management, capacity management, monitoring, event management, change management, and plant hygiene.
- Troubleshooting issues across the entire technology stack: hardware, software, application, and network.
- Participating in on-call rotation and periodic conference calls with other specialists from other time zones.
- Proactively identifying and addressing system reliability risks.
- Working closely with development teams to design, build, and maintain systems from a reliability, stability, and resiliency perspective.
- Identifying and driving opportunities to improve automation for our platforms; scope and create automation for deployment, management, and visibility of our services.
- Representing the RPE organization in design reviews and operational readiness exercises for new and existing products/services.
- Demonstrated ability to troubleshoot problems and debug to identify root cause on large-scale distributed applications across multiple layers, i.e. software, Infrastructure and database.
- Hands on experience on enterprise tools such as Prometheus, Grafana, Splunk, Apica
- Hands-on experience of UNIX / Linux system support and Cloud based services.
- Experience with Ansible, Git Hub or any automation/configuration/release management tools
- Automation-related experience is particularly valued using scripting languages such as python, bash, perl, ruby. One higher level language is desired.
- Creating stored procedures and optimising SQL in Sybase or DB2.
- Experience of Azure Networks, Service Bus, Azure Virtual Machines and Azure
SQL will be an advantage.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×