×
Register Here to Apply for Jobs or Post Jobs. X

Infrastructure Site Reliability Engineer

Job in Rockford, Winnebago County, Illinois, 61103, USA
Listing for: Hispanic Alliance for Career Enhancement
Full Time position
Listed on 2026-03-01
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer, SRE/Site Reliability, Network Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

We're building a world of health around every individual - shaping a more connected, convenient and compassionate health experience. At CVS Health®, you'll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger - helping to simplify health care one person, one family and one community at a time.

Position

Summary

As an Infrastructure Site Reliability Engineer, you will be responsible for designing, implementing, and managing the infrastructure systems and tools that enable reliability and performance of our technology platforms supporting various business initiatives within CVS Health. This role requires a strong background in infrastructure engineering and a commitment to proactive monitoring, troubleshooting, and optimizing systems for maximum uptime and performance. Collaborating with diverse teams, you will prioritize high availability, scalability, and resilience to ensure our platforms and services consistently meet and exceed customer expectations.

Primary

Responsibilities
  • Operations:
    Manage and maintain various systems and infrastructure, such as servers, storage, mainframe, iSeries, backup, archive, and recovery, ensuring the platforms have high availability, scalability, and reliability to meet the business requirements. Participate in on‑call rotation to ensure availability and uptime of critical systems and provide timely response and resolution to incidents. Develop and maintain best practices documentation, including system architecture diagrams, standard operating procedures, and runbooks.

    Perform system and application performance analysis, utilizing monitoring tools, logging systems, and other relevant metrics, to identify and resolve issues and enhance overall system performance.
  • Process Improvement:
    Streamline and optimize operational processes, procedures, and documentation by implementing industry best practices. Develop, modify, and implement incident and problem management processes to increase efficiency and reduce downtime. Establish a comprehensive SRE process that encompasses the entire software team, ensuring seamless operations and prompt resolution of any escalated issues.
  • System Support:
    Collaborate with development teams to participate in code reviews, performance optimization, and application deployment processes. Drive reliability engineering practices, including monitoring, alerting, incident management, capacity planning, and disaster recovery. Automate infrastructure deployments, upgrades, and maintenance tasks, utilizing configuration management tools like Ansible and infrastructure‑as‑code frameworks such as Terraform. Stay abreast of industry trends, emerging technologies, and best practices in infrastructure site reliability engineering and apply knowledge to continually improve CVS Health's systems and processes.

    Provide customer support with meticulously documented procedures, enabling them to proficiently address customer complaints and deliver optimal service.
  • Capacity Management:
    Analyze historical usage patterns and growth projections to forecast future capacity requirements. Collaborate with stakeholders such as developers, product managers, and operations teams to understand the demand for resources and estimate the necessary infrastructure capacity. Establish and maintain monitoring systems to track the performance and utilization of critical resources. Identify potential bottlenecks, anomalies, or areas of improvement. Perform regular performance reviews help ensure systems meet defined service‑level objectives (SLOs) and key performance indicators (KPIs).
  • Required

    Qualifications
    • 7+ years of experience in Infrastructure Engineering, System Administration, or related roles.
    • 3+ years of experience with cloud platforms (e.g., Amazon Web Services, Microsoft Azure) and infrastructure‑as‑code tools (e.g., Terraform, Cloud Formation).
    • 2+ years of experience in at least one configuration management tool such as Ansible, Puppet, or Chef.
    • 2+ years of experience with containerization technologies such as Docker and…
    To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
    (If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
     
     
     
    Search for further Jobs Here:
    (Try combinations for better Results! Or enter less keywords for broader Results)
    Location
    Increase/decrease your Search Radius (miles)

    Job Posting Language
    Employment Category
    Education (minimum level)
    Filters
    Education Level
    Experience Level (years)
    Posted in last:
    Salary