Sr. Site Reliability Engineer
Listed on 2026-02-28
-
IT/Tech
Cloud Computing, Systems Engineer, SRE/Site Reliability, Network Engineer
About the Role
At Pitch Book, a Morningstar company, we are always looking forward. We continue to innovate, evolve, and invest in ourselves to bring out the best in everyone. We're deeply collaborative and thrive on the excitement, energy, and fun that reverberates throughout the company.
Our extensive learning programs and mentorship opportunities help us create a culture of curiosity that pushes us to always find new solutions and better ways of doing things. The combination of a rapidly evolving industry and our high ambitions means there's going to be some ambiguity along the way, but we excel when we challenge ourselves. We're willing to take risks, fail fast, and do it all over again in the pursuit of excellence.
If you have a good attitude and are willing to roll up your sleeves to get things done, Pitch Book is the place for you.
About the RoleAs a member of the Product and Engineering team at Pitch Book, you will be part of a team of big thinkers, innovators, and problem solvers who strive to deepen the positive impact we have on our customers and our company every day. We value curiosity and the drive to find better ways of doing things. We thrive on customer empathy, which remains our focus when creating excellent customer experiences through product innovation.
We know that greatness is achieved through collaboration and diverse points of view, so we work closely with partners around the globe. As a team, we assume positive intent in each other's words and actions, value constructive discussions, and foster a respectful working environment built on integrity, growth, and business value. We invest heavily in our people, who are eager to learn and constantly improve.
Join our team and grow with us!
As a Sr. Site Reliability Engineer (SRE) in Pitch Book's engineering division, you will be creating and evolving systems to automatically run our suite of products and services reliably and consistently. As part of a team of site reliability engineers and platform engineers and in conjunction with group leadership, you will help define service level objectives (SLOs) that determine success and build systems to achieve those objectives.
You will utilize your strong background in deploying, managing, and maintaining production systems, working with developers to operate and monitor large-scale services with complex distributed systems and data integrations. You will incorporate observability tools (monitoring, telemetry, tracing, alerting), perform incident management, conduct root cause analyses, eliminate single points of failure, build reliability and redundancy into our infrastructure, establish and test our recoverability, mitigate failures, and do all of these things through automation and tools.
As a Sr. Site Reliability Engineer, you will take independent responsibility for building and managing large subsets of our systems. You will help build our best practices for infrastructure-as-code and your code will exemplify our quality controls. You will mentor and train other Site Reliability Engineers, platform engineers, and software engineers in reliability topics.
Your ability to collaborate with colleagues, exhibit poise and adaptability in stressful situations, communicate effectively, and build resilient systems that can be consistently relied upon will be critical to your success. You will solicit feedback, learn constantly, engage others with empathy, and help create a culture of belonging, teamwork, and purpose.
If you love building customer‑centric solutions, strive for excellence every day, are adaptable and focused, and believe work should be fun, come join us!
PrimaryJob Responsibilities
- Establish service level objectives (SLOs), error budgets, and service level indicators (SLIs) as success criteria that our systems and processes consistently meet or exceed.
- Build recoverability into our services and systems, including disaster recovery (DR), backups/recovery, and incorporation of multi‑AZ multi‑regional architecture into cloud constructs.
- Manage connectivity (CIDRs, VPCs, Subnets), latency, and availability across distributed systems.
- Establish clustering and load‑balancing…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).