Associate Principal, Infrastructure Engineering
Listed on 2026-03-01
-
IT/Tech
IT Support
***** THIS POSITION IS NOT ELIGIBLE FOR VISA SPONSORSHIP*****
What You'll Do:
The Associate Principal of Infrastructure Engineering is responsible for OCC's infrastructure engineering while driving continuous improvement across the production environment. This role combines hands-on technical expertise with strategic operational excellence, managing production administration of application scheduling and automation using Automic/UC4 software across AWS and on-premises platforms (Windows, MVS, Linux). The position serves as a technical evangelist for the best operational practices, working cross-functionally with Production Support, Release Management, Platform Services, Development, Testing, and Business Operations teams to shift the organization from reactive to proactive operations.
Primary Duties and Responsibilities:
To perform this job successfully, an individual must be able to perform each primary duty satisfactorily.
Infrastructure Engineering & Production Administration
Development Support:
Code and maintain uc4 objects for development and testing teams; provide first-level support for non-production environmentsProduction Support:
Provide second-level troubleshooting for scheduling issues in production environmentsAutomic Administration:
Perform basic Automic/UC4 system administration including starting/stopping the Automic system and agentsSecurity Management:
Maintain accurate inventory of all departmental owned logon IDs and passwords in Cyber ArkDisaster Recovery:
Provide support for all disaster recovery tests and exercises
Enterprise Scheduling & Automation
Complex Scheduling:
Develop, maintain, and optimize job schedules using Automic/UC4 software across multiple AWS cloud and on-premises platforms (Windows, MVS, Linux), managing multi-job dependencies and automated workflowsScripting & Automation:
Utilize Automic scripting language, Python, Power Shell, and other scripting tools to automate variable passing, job execution, and test environment setup; use REGEX for pattern matching and validationFile Transmission:
Coordinate and test new file transmissions with exchanges and members using Connect Direct, Sterling Integrator, and FTP protocolsBatch Processing:
Prepare and maintain batch jobs and REST API calls
Operational Excellence & Continuous Improvement
Proactive Operations:
Lead the shift from reactive to proactive operational posture by identifying and addressing issues before they impact productionPerformance Analysis:
Analyze and report on production performance, capacity planning, and critical-path processing opportunitiesMetrics & KPIs:
Develop, monitor, and report Key Performance Indicators to maintain compliance and drive measurable improvements through trend analysisProblem Resolution:
Identify and diagnose complex problems affecting production performance; stand up SWAT teams when needed to drive resolution of ongoing issuesCross-Domain
Collaboration:
Work across network, database, storage, and application teams to assist with tuning and optimizationRisk Management:
Surface environmental and operational risks; analyze repeating alerts to proactively identify issuesITIL Leadership:
Actively participate in and shepherd Incident, Problem, and Change Management processes using Service Now; ensure adherence to ITIL best practicesProcess Automation:
Evangelize for and implement repeatable, scalable automated processesCapacity Planning:
Forecast system demands and recommend upgrades, expansions, and reconfigurations
Documentation, Communication & Stakeholder Management
Documentation:
Maintain updated procedures on all supported products; create comprehensive process documentationStatus Reporting:
Provide daily status reports to management; attend project and status meetings as requiredKnowledge Sharing:
Cross-train team members and stakeholders; deliver training on new product releases and best practicesVendor Coordination:
Manage vendor support relationships and drive issue resolutionConsultation:
Serve as a consultant and evangelist for operational best practices across the organization
Additional Responsibilities
On-Call Support:
Provide on-call and/or on-site support for installs, production issues, and system availabilityOff-Hours Work:
Participate in after-hours and weekend maintenance windows as requiredTroubleshooting:
Perform complex hardware and software troubleshooting, taking corrective actions or coordinating with IT staff and vendors
Supervisory Responsibilities:
None
Qualifications:
The requirements listed are representative of the knowledge, skill, and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the primary functions.
Required:
Strongly customer service oriented with understanding of service level agreement urgency
Excellent consultative, communication, analytical, and judgment skills
Strong problem-solving and decision-making abilities with capacity to perform well under pressure
Highly detail-oriented with strong time…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).