×
Register Here to Apply for Jobs or Post Jobs. X

Senior Site Reliability Engineer, Observability

Job in Bogota, Bergen County, New Jersey, 07603, USA
Listing for: Chainlink Labs
Full Time position
Listed on 2026-03-01
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, IT Support, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Location: Bogota

Senior Site Reliability Engineer, Observability

Join Chainlink Labs as a Senior Site Reliability Engineer focused on Observability. The role supports our engineering teams by building a modern, OTEL‑based observability platform, driving reliability, security, and performance across a rapidly growing suite of blockchain services.

About Chainlink

Chainlink is the industry‑standard oracle platform bringing capital markets onchain and powering the majority of decentralized finance (DeFi). The Chainlink stack provides the essential data, interoperability, compliance, and privacy standards required for institutional tokenized assets, lending, payments, stable coins, and more. Since inventing decentralized oracle networks, Chainlink has enabled tens of trillions in transaction value and now secures the vast majority of DeFi.

About

the Observability Team

The Observability Team enables Chainlink development and empowers engineers to continue building and supporting crucial products and services that have a profound impact in the blockchain industry. Reliability is vital to the success of our company. As a Senior SRE you will accelerate and enable other engineering teams by increasing self‑service and decreasing cognitive load.

Your Impact
  • Build and orchestrate a modern OTEL‑based observability platform.
  • Support multiple telemetry types such as metrics, logs and traces.
  • Define and enforce governance for observability and problem management at scale.
  • Ensure reliability, security, and performance exceed defined SLAs.
  • Collaborate with engineers across the company to troubleshoot issues, deploy new products, and increase velocity while reducing cognitive load.
  • Lead the design and deployment of monitoring/observability services to detect and alert the team of needed actions.
  • Ingest, aggregate, transform, and utilize data from a multitude of sources in our real‑time data pipeline.
  • Oversee the availability, performance, and supportability of our observability infrastructure.
  • Create processes around alert‑response operations and support the team to ensure reliable delivery of oracle data.
  • Make recommendations to ensure sufficient metrics are collected to create alerts with every new feature release.
  • Champion reliability and security by taking the time to do your work right the first time.
Requirements
  • 7+ years of relevant professional experience; typically on devops, infrastructure, SRE, or platform teams.
  • Ability to develop software beyond typical infrastructure configurations.
  • Experience programming in C, C++, Java, Python, Go, Perl, or Ruby.
  • Expert knowledge in designing, developing, and managing large real‑time systems.
  • Experience with monitoring and logging: exporting metrics with Prometheus, building Grafana dashboards, and using a centralized logging solution such as ELK Stack, Splunk, or Grafana Stack.
  • Experience with distributed systems and container orchestration, including maintaining or building Kubernetes clusters and deploying new services on them.
  • Strong communication skills; capable of giving and receiving constructive feedback and participating in planning meetings and code reviews.
Desired Qualifications
  • Excitement for blockchain, Web 3.0, and similar decentralized technologies.
  • Experience running any infrastructure in the blockchain/web3 space.
  • Ability to scale systems sustainably through automation and evolution of reliability and velocity.
  • Experience working remotely in a distributed team.
  • A strong desire to grow and challenge yourself by continuously improving and automating services to reduce toil.
Tools and Services we use daily
  • AWS, Terraform/Terragrunt, Kubernetes, Calico, ArgoCD, Prometheus, Grafana, Git Hub Actions, Packer.
  • Comfortable and proficient use of the above tools is expected.
Commitment to Equal Opportunity

Chainlink Labs is an equal opportunity employer. All qualified applicants will receive equal consideration for employment in compliance with applicable laws, regulations, or ordinances. If you need assistance or accommodation due to a disability or special need when applying for a role or in our recruitment process, please contact us via the designated form.

Global Data Privacy Notice for Job Candidates…
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary