More jobs:
Principal Technical Program Manager
Job in
Salt Lake City, Salt Lake County, Utah, 84190, USA
Listed on 2026-01-20
Listing for:
Oracle
Full Time
position Listed on 2026-01-20
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing, AI Engineer
Job Description & How to Apply Below
* ** Position Overview*
* We are seeking a
** Principal Technical Program Manager (TPM)
** to join the
** Network Reliability Engineering (NRE) organization
** within
** Oracle Cloud Infrastructure (OCI)** . This role is part of the
** Availability PMO
** and is pivotal in driving large-scale, cross-organizational programs that improve network resiliency, automation, and AI-driven operations.
The ideal candidate will have deep experience leading technical initiatives at scale - orchestrating work across product, engineering, and operations - while driving measurable improvements in reliability, availability, and efficiency. You will also help shape OCI's transformation toward
** AI-powered and autonomous network operations** , collaborating closely with Network Engineering, GNOC, Automation, Monitoring and AI/ML product teams.
** Candidate Profile*
* +
** 6+ years of experience
** driving complex technical programs across large-scale cloud or network environments (preferably with 2+ years in AI/ML or automation-related programs).
+ Proven experience leading initiatives in
** cloud infrastructure, networking, or SRE/NRE
** domains.
+ Demonstrated success managing
** AI-enabled operations** , including predictive analytics, LLM-based knowledge systems, and self-healing automation.
+ Strong understanding of
** cloud architecture** , networking fundamentals (routing, connectivity, monitoring, telemetry), and
** data pipeline orchestration** .
+ Exceptional leadership and stakeholder management skills - able to influence across engineering, product, and operations at all levels.
+ Strategic thinker with strong analytical and problem-solving skills; able to turn ambiguous goals into measurable execution plans.
+ Excellent written and verbal communication skills, with the ability to synthesize complex technical topics for executive audiences.
+ Technical background with the ability to discuss APIs, ML workflows, data architectures, and automation frameworks with engineering teams.
+ Experience working in an
** AI/Automation-driven operational environment** (e.g., AIOps, MLOps, network observability, or autonomous infrastructure) strongly preferred.
** Preferred Qualifications*
* + Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related technical field.
+
Experience with
** AI/ML product integration** ,
** LLM-based automation** , or
** AI for Operations (AIOps)
** tools.
+ Familiarity with
** Terraform, Python, REST APIs, and cloud platforms (OCI, AWS, Azure, GCP)** .
+ Strong understanding of
** operational metrics** ,
** incident lifecycle management** , and
** continuous improvement processes** .
** Responsibilities*
* ** Key Responsibilities*
* + Lead large, complex, cross-functional programs that enhance OCI's network reliability, observability, and operational automation.
+ Define and execute multi-year program strategies focused on
** AI-driven incident prediction, root cause analysis (RCA), and self-healing automation** .
+ Partner with NRE, Network Operations, and AI product teams to integrate
** Machine Learning (ML), Large Language Models (LLMs), and predictive analytics
** into operational workflows.
+ Drive end-to-end program execution - from requirements definition, design reviews, and data integration - through delivery, measurement, and continuous improvement.
+ Establish and track
** Key Performance Indicators (KPIs)
** for availability, MTTR, automation coverage, and RCA accuracy.
+ Develop and implement frameworks for
** operational excellence** , program governance, and incident learning systems.
+ Communicate progress and outcomes to OCI executive leadership and key stakeholders, ensuring alignment with business priorities and customer commitments.
+ Proactively identify risks, dependencies, and bottlenecks across global teams and create mitigation and acceleration plans.
Disclaimer:
** Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.*
* ** Range and benefit information provided in this posting are specific to the stated locations only*
*…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×