Skip to main content
Mercor logo

Automation Systems Engineer

Mercor
Full Timemid
CAPosted March 15, 2026

Job Description

**About The Job**

  • *Mercor**

connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include

  • *Benchmark**

,

  • *General Catalyst**

,

  • *Peter Thiel**

,

  • *Adam D'Angelo**

,

  • *Larry Summers**

, and

  • *Jack Dorsey**

.

  • *Position:**

SWE Expert

  • *Type:**
  • *Contract**
  • *Compensation:**
  • *$70–$150/hour**
  • *Role Responsibilities**
  • Convert high-level objectives into tightly scoped, testable deliverables with clear inputs/outputs and measurable success criteria.
  • Create structured documentation defining expected behavior, constraints, and edge cases for reuse by other evaluators.
  • Build lightweight automation scripts to support evaluation flows, such as generating required artifacts and validating outputs.
  • Write deterministic Python verifier scripts for completion checks via final state or output validation.
  • Design prompts/tasks to reliably elicit target workflow behavior while avoiding leakage of internal instructions.
  • Implement robust error handling and actionable failure messages in verification tooling.
  • Develop plausible but ineffective “baseline” or “distractor” approaches to confirm evaluation discrimination.
  • Maintain clean artifact hygiene with versionable structure, consistent naming, and reproducible execution.
  • *Qualifications**
  • *Must-Have**
  • Strong Python skills in file system operations, parsing, validation, and deterministic execution.
  • Experience with evaluation harnesses, automated grading, or QA-style verification.
  • Familiarity with prompt design and LLM evaluation methodologies.
  • Comfort with structured specs and documentation conventions like Markdown and YAML.
  • Working knowledge of Git, CLI workflows, virtual environments, and dependency management.
  • *Preferred**
  • Knowledge of embeddings/similarity concepts like cosine similarity for negative-control design.
  • Ability to communicate clearly and control scope without relying on domain-specific context.
  • *Application Process (Takes 20–30 mins to complete)**
  • Upload resume
  • AI interview based on your resume
  • Submit form
  • *Resources & Support**
  • For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
  • For any help or support, reach out to: support@mercor.com
  • PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*

,

Want AI-powered job matching?

Upload your resume and get every job scored, your resume tailored, and hiring manager emails found - automatically.

Get Started Free