Senior Site Reliability Engineer, Observability

Framework Ventures

Full Timesenior

Bella Coola, British Columbia, CAPosted February 23, 2026

Resume Keywords to Include

Make sure these keywords appear in your resume to improve ATS scoring

PythonJavaGoRubySwiftAWSKubernetesTerraformGitHub ActionsGitHubDevOps

Job Description

Overview

About Chainlink

Chainlink is the industry-standard oracle platform bringing the capital markets onchain and powering the majority of decentralized finance (DeFi). The Chainlink stack provides essential data, interoperability, compliance, and privacy standards needed to power advanced blockchain use cases for institutional tokenized assets, lending, payments, stablecoins, and more. Chainlink has enabled tens of trillions in transaction value and now secures the vast majority of DeFi. Many of the world’s largest financial services institutions have adopted Chainlink’s standards and infrastructure, including Swift, Euroclear, Mastercard, Fidelity International, UBS, S&P Dow Jones Indices, FTSE Russell, WisdomTree, ANZ, and top protocols such as Aave, Lido, GMX and many others. Chainlink leverages a novel fee model where offchain and onchain revenue from enterprise adoption is converted to LINK tokens and stored in a strategic Chainlink Reserve. Learn more at chain.link.

The Observability Team enables Chainlink development and empowers engineers to continue building and supporting crucial products and services that have a profound impact in the blockchain industry. Reliability is vital to the success of our company. As a Senior SRE, you will help us accelerate and enable other engineering teams by increasing self-service and decreasing cognitive load.

This role is ideal for someone with a strong DevOps mindset, a passion for building and maintaining a mature GitOps environment, and experience focusing on observability. The entire engineering team is expanding, offering opportunities to build, learn, and grow.

We welcome applicants from diverse backgrounds. If you think you would do a great job at Chainlink, we look forward to speaking with you, even if you don\'t match 100% of the job requirements: those describe people we\'ve usually had a great time working with, but they\'re not a tick-box exercise.

Your Impact

Build and orchestrate Modern OTEL-based Observability Platform
Support multiple telemetry types, like metrics, logs and traces
Define and support modern governance in observability and problems at scale
Ensure reliability, security, and performance exceed our defined SLAs
Collaborate with engineers across the company to troubleshoot issues, deploy new products and services, and increase velocity while decreasing cognitive load
Lead the design and deployment of monitoring/observability services to detect and alert the team of needed action
Ingest, aggregate, transform, and utilize data from multiple sources in our real-time data pipeline
Oversee availability, performance, and supportability of our observability infrastructure
Create processes around alert response operations and support the team to ensure reliable delivery of oracle data
Suggest metrics to enable alerts with every new feature release
Champion reliability and security by doing work right the first time

Requirements

7+ years of relevant professional experience in devops, infrastructure, SRE, and/or platform teams
Ability to develop software beyond typical infrastructure requirements and configurations
Experience programming in C, C++, Java, Python, Go, Perl, or Ruby
Expert knowledge in designing, developing, and managing large real-time systems
Experience with monitoring and logging; exporting metrics with Prometheus; Grafana dashboards; and centralized logging solutions like ELK Stack, Splunk, or Grafana Stack
Experience with distributed systems and container orchestration; maintenance or building Kubernetes clusters; deploying new services on Kubernetes
Strong communication skills with comfort in planning meetings and code reviews

Desired Qualifications

Excitement for blockchain, Web 3.0, and decentralized technologies
Experience running infrastructure in the blockchain/web3 space
Ability to scale systems sustainably through automation and evolving systems for reliability and velocity
Experience working remotely in a distributed team
Desire to grow and automate services to reduce toil

Tools and Services