OpenShift DevOps Engineer
Listed on 2026-03-09
-
IT/Tech
Systems Engineer, Cloud Computing
Outstanding long-term contract opportunity! A well-known Financial Services Company is looking for a Open Shift Dev Ops Engineer – in Charlotte, NC. Phoenix, AZ. Chandler, AZ. or Irving, TX. (Hybrid).
Work with the brightest minds at one of the largest financial institutions in the world. This is a long-term contract opportunity that includes a competitive benefit package! Our client has been around for over 150 years and is continuously innovating in today’s digital age. If you want to work for a company that is not only a household name, but also truly cares about satisfying customers’ financial needs and helping people succeed financially, apply today.
Contract Duration: 12 Months
The Open Shift Container Platform Engineer operates within the IT Operations (IT Ops) model and is responsible for the stability, availability, lifecycle management, and continuous improvement of the enterprise Kubernetes platform.
This role supports business-as-usual (BAU) operations, cluster repair and recovery, platform readiness, patching, and version upgrades across environments. The engineer partners with Infrastructure, Security, Network, and Application Support teams to ensure the container platform meets enterprise service level objectives (SLOs), compliance requirements, and operational standards.
A strong automation mindset is required, with hands-on Python experience to drive operational efficiencies, deployment enhancements, and platform reliability improvements.
- Incident & Major Incident Management
- Problem Management & Root Cause Analysis
- Change & Release Management
- Capacity & Availability Management
- Configuration & Patch Management
- Platform Lifecycle Governance
1. Platform Operations (BAU Support)
- Provide L2/L3 operational support for Red Hat Open Shift clusters.
- Monitor and maintain Kubernetes cluster health, performance, and availability.
- Respond to incidents, service requests, and platform escalations.
- Perform root cause analysis and implement corrective/preventative actions.
- Maintain operational documentation and runbooks.
- Troubleshoot and repair degraded or failed cluster components (nodes, control plane, etcd).
- Execute node recovery, certificate rotation, and infrastructure remediation.
- Validate disaster recovery readiness and high-availability configurations.
- Support resilience testing and failover exercises.
- Plan and execute Open Shift and Kubernetes version upgrades.
- Apply platform patches, security updates, and configuration changes in accordance with Change Management processes.
- Perform impact analysis and backout planning for platform changes.
- Ensure production readiness validation prior to releases.
- Ensure environments (Dev, QA, Prod) meet operational readiness standards.
- Validate compliance with security baselines and hardening guidelines.
- Maintain cluster capacity and scaling strategies aligned with demand.
- Partner with Security and Risk teams for vulnerability remediation.
- Develop Python automation for:
- Cluster provisioning and validation
- Health checks and diagnostics
- Deployment improvements
- Repetitive operational tasks
- Reduce manual intervention through scripting and workflow automation.
- Integrate automation into CI/CD and operational tooling.
- Continuously improve reliability through automation-driven controls.
- 5 years in Infrastructure or IT Operations engineering roles.
- 3 years hands-on experience with:
- Red Hat Open Shift Container Platform
- Kubernetes container orchestration
- Experience supporting production enterprise platforms within an IT Ops model.
- Strong Linux (RHEL preferred) administration skills.
- Proven experience performing:
- Cluster upgrades
- Patching and lifecycle management
- Incident response and RCA
- Proficiency in Python for operational automation and deployment enhancements.
- Experience working within ITIL-based service management frameworks.
- Hybrid or multi-cluster enterprise environments.
- Experience with enterprise monitoring and logging platforms.
- Familiarity with Git Ops and CI/CD integration.
- Open Shift or Kubernetes certifications.
- Experience supporting regulated or high-availability environments.
- Platform availability and uptime targets achieved
- Reduction in manual operational effort through automation
- Successful execution of upgrades and patches with minimal disruption
- Improved MTTR (Mean Time to Resolution)
- High platform reliability and operational readiness
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).