×
Register Here to Apply for Jobs or Post Jobs. X

Operations Reliability Engineer

Job in Irvine, Orange County, California, 92713, USA
Listing for: Origence
Full Time position
Listed on 2026-02-28
Job specializations:
  • IT/Tech
    IT Support, Systems Engineer, Cloud Computing, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below

Origence is always looking for diverse, talented people to join our exceptional team. Current job opportunities are posted here as they become available.

With 30 years at the forefront of fintech innovation, we specialize in SaaS lending solutions that lead the industry. Our core mission is customer-centric, focusing on empowering Credit Unions across the United States with the tools to offer accessible, competitive lending services. We're deeply committed to enhancing the financial ecosystem for a broad network of credit unions, members and auto dealers.

We invest in our greatest assets, our employees, and foster a culture of innovation and ownership through freedom and responsibility. We celebrate fiscal accountability, operational rigor and efficiency to create a sustainably healthy and robust business for the long term.

About you

You are a self-driven, conscientious, fiscally responsible, self-aware, passionate and compassionate professional. You are comfortable with ambiguity, eternally curious, and love problem solving. You operate as an owner and work with a growth mindset. You are extremely productive on your own, and act as a multiplier collaborating with others. You are tireless in questioning the status quo and pursue the best answers to the hardest problems to the benefit of the business.

Your focus is strong and capable of context switching and pivoting with the business. In the vacuum of leadership, you assume it.

The IT Operations Reliability Engineer is responsible for ensuring the stability, reliability, and operational readiness of enterprise systems. This role owns core IT operational functions, including incident response, change management, release readiness, and recurring operational reporting.

Operating in a Dev Ops-focused environment, this position requires strong independent execution, proactive risk identification, disciplined documentation, and clear, concise communication. Success in this role is measured by consistency, follow-through, and the ability to surface and address risks before they impact the business.

This is not a software development role, but it requires sound technical judgment, system-level thinking, and the ability to work closely with engineers to diagnose issues, mitigate risk, and improve overall system resilience. The role exists to establish operational reliability as a measurable, scalable discipline, reducing reactive incidents, improving resilience, and increasing organizational confidence as the platform grows.

What You’ll Be Doing:

Operational Ownership and Reliability:
  • Independently own recurring operational deliverables and reports, ensuring they are completed accurately and on schedule with a high degree of autonomy
  • Monitor system performance, availability, and reliability to maintain high uptime and service quality
  • Use observability tools (e.g., Datadog, Grafana) to identify trends, risks, and potential failure modes before they result in business impact
  • Define and evolve operational standards across IT Operations
  • Influence engineering roadmaps through data-driven operational insights
  • Establish, monitor, and refine service level objectives (SLOs) and error budgets aligned with business priorities and customer impact
  • Conduct trend analysis and systematic risk reviews to reduce repeat incidents and operational noise
  • Partner with engineering to prioritize reliability improvements based on incident patterns and performance data
Process Discipline and Continuous Improvement:
  • Maintain accurate shift notes, dashboard, and operational documentation that reflect current system health
  • Track and analyze KPIs related to uptime, performance, scalability, SLAs/SLOs, MTTA, and MTTR
  • Use operational metrics and observability data to identify systematic issues, recurring failure patterns, and opportunities for automation or resilience improvements
  • Define, measure, and report on reliability metrics including error budgets, availability targets, and service health indicators
  • Use operational data to guide trade-offs between feature velocity and long-term stability
Incident, Change and Release Management:
  • Lead blameless post-incident reviews focused on systemic remediation,…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary