×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer

Job in Alpharetta, Fulton County, Georgia, 30239, USA
Listing for: Now100
Full Time position
Listed on 2026-02-28
Job specializations:
  • IT/Tech
    Cloud Computing, IT Support, IT Project Manager
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below

The Wealth Management Production Management Site Reliability Engineer position is a highly visible/critical role, which will be a team member of technical SMEs managing the stability and optimization of the Wealth Management systems. Scope includes but not limited to, the day-to-day support of the organization’s technology related outages, collaboration on technology projects

focused on stability, optimization, business impact analysis, and associated risk-related methodologies. This role will be responsible for overall stability of the Wealth Management Investment Management application platforms, participation on key optimization initiatives, and collaboration with multiple technical teams within . Additionally, partner with WM business units, various levels of management and staff to collect, analyze and make recommendations on optimizing the platform.

This position will mainly perform Dev Ops/SRE role in Java, Unix & SQL technologies technology.

Responsibilities
  • Incident Management
    -Create and manage necessary process involving incidents
  • Partner with Ops Control to ensure IT and/or End User communications are handled appropriately
  • Engage with the development team throughout the life cycle to support Application build for Reliability
  • Develop software to automate manual operational work
  • Run, maintain and improve the service against established Service Level Objectives by applying software engineering principles
  • Responsible for the availability, performance, change (CP) management, monitoring, and capacity management of their services
  • Troubleshoot priority incidents, conduct blameless post-mortems and ensure permanent closure of the incidents
  • Analyze patterns of production incidents, develop permanent remediation plans, and implement automation to prevent future incidents from occurring through software engineering
  • Manage process related functions around large-scale events such as disaster recovery. Communicate closely with impacted groups to ensure all events are properly managed.
Primary Skills / Must have
  • Bachelor’s/Master’s Degree in Computer Science, Information Systems or related field
  • Proven track record supporting large scale multi-tiered cloud-based applications.
  • Hands on experience with Java, Angular, Spring, DB2, Unix scripting and experienced in scheduler tools such as TWS, autosys.
  • Experience working in an Agile Development environment
  • Proven ability to understand and troubleshoot complex problems under pressure
  • Excellent communication skills (both written and oral), listening skills, influencing and negotiation skills
  • Experience with performance troubleshooting and remediation
  • Experience with observability tools such as Splunk, Kibana, Grafana, Prometheus
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary