Innovative Site Reliability Engineer AI Compute Infrastructure

Full Timemid

Conklin, Alberta, CAPosted 8 weeks ago

Role Overview

Cerebras is hiring a mid-level Innovative Site Reliability Engineer AI Compute Infrastructure. This is a full-time role in Conklin, Alberta. Part of Cerebras's Devops hiring. Full responsibilities, required qualifications, and the apply link are listed in the description below.

Resume Keywords to Include

Make sure these keywords appear in your resume to improve ATS scoring

PythonDockerKubernetesLinuxTelemetryInnovativeSiteReliability

Job Description

Position: Innovative Site Reliability Engineer for AI Compute Infrastructure

Location: Conklin

Step into a vital role as a Site Reliability Engineer to enhance AI inference deployments. Drive operational excellence across rapidly growing datacenter environments while utilizing your advanced technical skills.

This role involves hands-on management of AI clusters and ensuring reliable software deployment strategies within high-capacity infrastructures. You will tackle challenges in telemetry, observability, and the development of automated deployment pipelines, allowing for seamless capacity reallocation and maximized performance across systems. Your contributions will be integral in maintaining and growing our leading-edge technology.

Key Responsibilities:

Operate across diverse datacenter environments experiencing rapid growth
Ensure reliability of AI inference deployments at scale
Develop solutions for telemetry and observability
Advance deployment automation for efficient operations
Collaborate on translating requirements with internal teams

Requirements

2-5 years in high-performance compute operations
Strong automation skills with Python
Experience with Linux systems and command-line tools
Knowledge of Docker and Kubernetes
Familiarity with Prometheus and Grafana for observability

Leverage your expertise in AI compute infrastructure to make significant impacts in deployment efficiency and operational reliability in an innovative environment.

#J-18808-Ljbffr

Frequently Asked Questions

How do I apply for the Innovative Site Reliability Engineer AI Compute Infrastructure position at Cerebras?

Use the Apply button above to submit your application directly to Cerebras. Most applications take less than 5 minutes if your resume and contact details are ready, and you'll be routed to the employer's official application system to finish.

Where is the Innovative Site Reliability Engineer AI Compute Infrastructure position at Cerebras located?

This position is based in Conklin, Alberta. Cerebras has not indicated remote or hybrid options for this role, so candidates should plan for on-site work.

What does a Innovative Site Reliability Engineer AI Compute Infrastructure at Cerebras earn?

Cerebras has not disclosed a salary range in this posting. Many employers share specifics later in the interview process; you can also ask during a recruiter screen if compensation transparency is important to you.

When was the Innovative Site Reliability Engineer AI Compute Infrastructure role at Cerebras posted?

This role was posted on April 6, 2026 (58 days ago). It's still listed as actively hiring; we re-confirm openings against the source system multiple times per day and remove closed roles.

Browse Remote DevOps Engineer Jobs →

AI-powered job search

Get every job scored to your resume

Upload your resume and get jobs ranked, your resume tailored, and employee contacts found automatically.

Get Started Free

No credit card to start