Role Overview
Tenstorrent is hiring a mid-level Site Reliability Engineer/ Metal. This is a full-time hybrid role, based in CA. Part of Tenstorrent's Devops hiring. Full responsibilities, required qualifications, and the apply link are listed in the description below.
Resume Keywords to Include
Make sure these keywords appear in your resume to improve ATS scoring
Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score
Job Description
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
Tenstorrent is building large-scale AI systems across internal clusters and customer deployments. This role sits at the intersection of site reliability, infrastructure operations, and customer engineering, ensuring our systems are reliable, observable, and production-ready.
This role is hybrid, based out of Toronto, ON; Austin, TX; or Santa Clara, CA.
We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.
Who You Are
- Experienced in site reliability, infrastructure, or systems engineering in distributed environments.
- Strong Linux systems knowledge with the ability to troubleshoot complex multi-layer issues.
- Proficient with observability tools such as Prometheus, Grafana, and alerting systems.
- Comfortable with scripting and automation using Python, Go, or similar languages.
- Solid understanding of networking fundamentals and how systems behave at scale.
What We Need
- Ensure reliability and operational health of Tenstorrent systems across internal and customer environments.
- Troubleshoot complex issues across compute, networking, and software layers.
- Partner with engineering teams and customers to resolve production incidents.
- Design and improve monitoring, observability, and alerting systems.
- Build automation to reduce operational toil and improve system reliability.
What You Will Learn
- How large-scale AI infrastructure is operated across internal clusters and customer deployments.
- How distributed systems behave under real-world production conditions.
- How observability and automation drive reliability at scale.
- How hardware, networking, and software systems interact in AI environments.
- How customer-facing AI infrastructure is deployed, supported, and optimized.
Compensation for all engineers at Tenstorrent ranges from $100k - $500k including base and variable compensation targets. Experience, skills, education, background and location all impact the actual offer made.
Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.
This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology. Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2). These requirements apply to persons located in the U.S. and all countries outside the U.S. As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency. If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.
Original job Site Reliability Engineer/ Metal posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Frequently Asked Questions
How do I apply for the Site Reliability Engineer/ Metal position at Tenstorrent?
Use the Apply button above to submit your application directly to Tenstorrent. Most applications take less than 5 minutes if your resume and contact details are ready, and you'll be routed to the employer's official application system to finish.
Is the Site Reliability Engineer/ Metal role at Tenstorrent remote or in-office?
This is a hybrid role based in CA. Expect a mix of in-office and remote days, with the specific cadence set by the hiring manager.
What does a Site Reliability Engineer/ Metal at Tenstorrent earn?
Tenstorrent has not disclosed a salary range in this posting. Many employers share specifics later in the interview process; you can also ask during a recruiter screen if compensation transparency is important to you.
When was the Site Reliability Engineer/ Metal role at Tenstorrent posted?
This role was posted on April 12, 2026 (58 days ago). It's still listed as actively hiring; we re-confirm openings against the source system multiple times per day and remove closed roles.
AI-powered job search
Get every job scored to your resume
Upload your resume and get jobs ranked, your resume tailored, and employee contacts found automatically.
Get Started FreeNo credit card to start