×
Register Here to Apply for Jobs or Post Jobs. X

Systems Reliability Engineer

Job in New York, New York County, New York, 10261, USA
Listing for: mthree Recruiting Portal
Full Time position
Listed on 2026-03-05
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below
Location: New York

Market leading investment bank requires a Systems Reliability Engineer join their Reliability & Production Engineering department. This role supports Institutional Securities and Wealth Management brokerage Operations platforms which include diverse technologies hosted by on premises and cloud platforms. The role is expected to perform day to day support for the business alongside reliability engineering tasks. The role has an emphasis on improving the reliability of our systems by working with the Software developers and Infrastructure engineering teams to develop automated reliability solutions.

Responsibilities
  • You will spend time on production management, inclusive of: incident and problem management, capacity management, monitoring, event management, change management, and plant hygiene.
  • Troubleshooting issues across the entire technology stack: hardware, software, application, and network.
  • Participating in on-call rotation and periodic conference calls with other specialists from other time zones.
  • Proactively identifying and addressing system reliability risks.
  • Working closely with development teams to design, build, and maintain systems from a reliability, stability, and resiliency perspective.
  • Identifying and driving opportunities to improve automation for our platforms; scope and create automation for deployment, management, and visibility of our services.
  • Representing the RPE organization in design reviews and operational readiness exercises for new and existing products/services.
Experience
  • Demonstrated ability to troubleshoot problems and debug to identify root cause on large-scale distributed applications across multiple layers, i.e. software, Infrastructure and database.
  • Hands on experience on enterprise tools such as Prometheus, Grafana, Splunk, Apica
  • Hands-on experience of UNIX / Linux system support and Cloud based services.
  • Experience with Ansible, Git Hub or any automation/configuration/release management tools
  • Automation-related experience is particularly valued using scripting languages such as python, bash, perl, ruby. One higher level language is desired.
  • Creating stored procedures and optimising SQL in Sybase or DB2.
  • Experience of Azure Networks, Service Bus, Azure Virtual Machines and Azure

    SQL will be an advantage.
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary