×
Register Here to Apply for Jobs or Post Jobs. X

Senior SRE Systems Engineer

Job in Urbandale, Polk County, Iowa, 50322, USA
Listing for: Berkley Technology Services
Full Time position
Listed on 2026-03-04
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, SRE/Site Reliability, IT Project Manager
Job Description & How to Apply Below
Job Summary
:
Berkley Technology Services (BTS) is a dynamic technology solution for W. R. Berkley Corporation, a Fortune 500 Commercial Lines Insurance Company. The Senior SRE Systems Engineer will play a crucial role in ensuring the reliability, scalability, and performance of software systems, collaborating closely with teams to enforce best practices and enhance overall efficiency.

Responsibilities
:

• Define, and track reliability and observability OKRs. This includes defining and tracking Service Level Objectives (SLOs) and Service Level Indicators (SLIs).

• Implement robust monitoring and alerting systems to proactively monitor health, identify potential issues, analyze system performance, and facilitate quick response to incidents.

• Implement AIOps functionality to enable auto-response, self-healing, and anomaly trend analysis.

• Drive the development and implementation of automation solutions to remove 'toil', streamline processes, reduce manual interventions, and enhance the overall efficiency of the product engineering and SRE teams.

• Identifying and addressing performance bottlenecks in applications and infrastructure to improve efficiency and user experience.

• Work closely with incident management to quickly address and resolve system outages or performance issues to minimize downtime and impact on users.

• Collaborate actively with development and operations teams to implement observability and resiliency requirements in order to ensure smooth deployment and operation of software systems.

• Lead the coordination with product, development, infrastructure, and architecture teams to conduct capacity planning, ensuring that systems can handle current and future demand; anticipate growth and scalability requirements.

• Improve reliability by identifying and addressing gaps in our architecture, services, and tooling.

• Modernize disaster recovery program for both on premise and Cloud-based Berkley solutions.

Qualifications
:
Required
:

• 5+ years of IT experience working with infrastructure support and development

• 5+ years of experience of Site Reliability Engineering and Dev Ops.

• Proficient in scripting languages like Python, Go, Bash, and/or JavaScript, and experience with Shell Scripting.

• Strong expertise of observability, monitoring, alerting, and logging tools (Dynatrace, Datadog, ELK Stack)

• Practical expertise in creating and implementing logging and monitoring architectures through hands-on experience.

• Expertise in designing and implementing on-premises, cloud, and hybrid resiliency solutions (HA, AA, AP), disaster recovery, and business continuity planning.

• Deep understanding of cloud computing principles, including IaaS, PaaS, and SaaS models.

• Experience with Kubernetes and other auto-scaling tools and technologies. Including proficiency with tools such as Helm and Prometheus for deployment and monitoring.

• Proficient in leveraging Git Ops with containerization technologies and CI/CD pipelines.

• Develop and implement automated system reliability and performance solutions including infrastructure automation and configuration management tools (Git Hub Actions, Terraform, Ansible, Chef, Puppet).

• Solid understanding of security best practices in on-premises, cloud, and hybrid environments along with Network technologies.

• Understanding of industry standard security frameworks and ability to interpret them for Berkley environments.

• Ability to drive critical issues and system design discussions and moderate between multiple technology teams.

• Demonstrated leadership experience, including mentoring junior engineers and leading technical projects.

• Excellent problem-solving skills and the ability to troubleshoot complex issues in a distributed hybrid environment.

Strong communication skills to collaborate effectively with cross-functional teams and convey technical concepts to non-technical stakeholders.

Company
:
Berkley Technology Services offers networking, software development, UI/UX design, project management and IT shared services. Founded in 2001, the company is headquartered in Wilmington, USA, with a team of 201-500 employees. The company is currently Growth Stage.
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary