M
Himalayas

Software Engineer - Evaluation Author

mercor

Remote Worldwide · Full-time · Remote

Apply Now

You will apply on Himalayas. Bubird keeps the source attribution visible.

Work mode

Remote

Job type

Full-time

Experience

3-5 years

Salary

USD 35 - 120

Job Description

About the job Mercor connects elite creative and technical talent with leading AI research labs.

Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .

Position: Code-Data Eval Author — Software Engineer Type: Contract Compensation: $35–$120/hour Location: Remote Commitment: 30+ hours/week Role Responsibilities Author non-trivial coding tasks with golden solutions and automated verifiers.

Design rubrics and grade agent trajectories and model outputs.

Improve task and rubric quality through structured review.

Evaluate the accuracy and depth of AI-generated content to strengthen reasoning and rigor in model outputs.

Work independently and asynchronously to meet deadlines while improving AI model performance .

Qualifications Must-Have 5+ years of software engineering at a real product organization (big tech or venture-backed startup).

Strong code quality, systems design, debugging, and testing discipline.

Clear written communication (you write instructions others follow).

Preferred Familiarity with AI coding tools and evals.

Interview Process Short Mercor Technical Screen .

Live Code Review Session .

Domain Expert Interview .

Compensation & Benefits

Compensation

USD 35 - 120

Find Similar Other Jobs Jobs

Browse more active roles in Remote Worldwide, or explore the full Other Jobs category.

Ready to find your next opportunity?

Fresh job listings, free tools, and direct application links.

Browse Jobs
Apply Now