Skip to main content
Cerebras logo

Dynamic Deployment Engineer for Machine Learning Inference Clusters

Cerebras
Full Timemid
CAPosted April 9, 2026

Resume Keywords to Include

Make sure these keywords appear in your resume to improve ATS scoring

PythonDockerKubernetesLinuxAgile

Sign up free to auto-tailor your resume with all these keywords and get a higher ATS score

Job Description

Become a Deployment Engineer focused on revolutionizing AI inference capabilities. Enhance deployment reliability and operational efficiency within sophisticated AI compute infrastructures.

In this essential role, you will lead the deployment of AI inference replicas and optimize software rollout across various global datacenters. Utilizing your systems engineering and operational skills, you will develop advanced telemetry solutions and automated pipelines, playing a key part in capacity management. Your work will bridge technical requirements with internal teams to ensure seamless operations.

Key Responsibilities:

  • Deploy and manage AI inference software across multiple datacenters
  • Operate in rapidly growing heterogeneous environments
  • Optimize capacity allocation and replica positioning
  • Enhance telemetry and observability frameworks
  • Build automated deployment pipelines for agile operations

Requirements

  • 2-5 years in on-prem compute infrastructure
  • Expertise in Python for tooling and automation
  • Proficient in Linux and command-line utilities
  • Experience with Docker containers and Kubernetes
  • Familiarity with telemetry tools like InfluxDB and Grafana

Drive innovation in AI model deployment with your technical expertise and contribute to groundbreaking advancements in the AI landscape.

#J-18808-Ljbffr

Want AI-powered job matching?

Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.

Get Started Free