Senior Infrastructure Engineer
Listed on 2026-02-27
-
IT/Tech
Systems Engineer, Cloud Computing
Position Overview
At PNC, our people are our greatest differentiator and competitive advantage in the markets we serve. We are all united in delivering the best experience for our customers. We work together each day to foster an inclusive workplace culture where all of our employees feel respected, valued and have an opportunity to contribute to the company’s success. As a Senior Infrastructure Engineer within PNC's Retail organization, you will be based in Denver, CO, Strongsville, OH or Pittsburgh, PA.
PNC is an in‑office company that fosters a supportive culture where employees can thrive and achieve balance. We encourage candidates to connect with their recruiter and hiring manager to understand workplace expectations and ensure the role aligns with their goals.
PNC will not provide sponsorship for employment visas or participate in STEM OPT for this position.
Key ResponsibilitiesPlatform & Containerization Expertise
- Serve as the Subject Matter Expert (SME) for Open Shift Kubernetes, including on‑prem cluster utilisation and application namespace resource management.
- Drive container resource optimisation for applications.
Infrastructure & Capacity Engineering
- Support enterprise‑wide capacity forecasting, resource planning, and reclamation efforts across compute, storage, and network layers.
- Review and optimise converged infrastructure solutions to support large‑scale enterprise workloads and automated orchestration.
- Drive resource optimisation practices, ensuring efficient utilisation of containers, JVM resources, and underlying infrastructure components.
Application Runtime & Performance Optimization
- Provide deep expertise in JVM configuration, tuning, and Garbage Collection (GC) optimisation to improve system responsiveness and stability.
- Partner with software development teams to guide application performance optimisation aligned with platform capabilities and best engineering practices.
Nonfunctional Requirements (NFR) Leadership
- Translate and enforce NFRs including performance, throughput, security, scalability, resiliency, stability, and operational continuity.
- Assess architecture and solution designs for resiliency, ensuring compliance with enterprise standards for failover, high availability, and Disaster Recovery readiness.
Reliability & Operational Excellence
- Develop and maintain dashboards, tools, and frameworks for operational health monitoring, resource consumption metrics, and capacity forecasting.
- Future scope:
Lead resiliency engineering initiatives such as chaos testing and failure injection to proactively identify and mitigate system weaknesses (not currently in scope).
Education, Training & Developer Guidance
- Deliver training and platform guidance via one‑on‑one consultations, office hours, lunch‑and‑learns, and technical roadshows.
- Produce high‑quality technical documentation, instructional materials, Confluence articles, and presentations.
Technical Expertise
- Expert‑level knowledge of Open Shift Kubernetes administration, including on‑prem cluster management.
- Strong capability in capacity management across compute, storage, and network stacks.
- Experience engineering converged infrastructure solutions for enterprise workloads.
- Proven skill in resource optimisation, including container efficiency, JVM tuning, and platform performance improvements.
- Deep knowledge of JVM internals, GC algorithms, tuning strategies, and performance analysis tooling.
- Ability to partner with developers on application‑level performance tuning and system‑to‑application alignment.
- Strong understanding of NFR frameworks—performance, scalability, security, resiliency, and operational continuity.
Communication & Leadership Skills
- Proven presenter and facilitator with experience delivering training via consultations, workshops, roadshows, and office hours.
- Ability to produce clear, concise, high‑quality technical documentation and instructional content.
- Open Shift, Kubernetes, Operators, Helm, Istio / Service Mesh
- Linux (RHEL), Ansible, Git, CI/CD pipelines, Terraform
- JVM technologies (OpenJDK, Hotspot, GC algorithms such as G1, ZGC)
- Splunk, Dynatrace, Prometheus, Grafana,…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).