Manufacturing Expert - Quality Evaluator

Remote · USA Full-time New today

• *About The Job

*Mercor

connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include

*Benchmark**

,

*General Catalyst**

,

*Peter Thiel**

,

*Adam D'Angelo**

,

*Larry Summers**

, and

*Jack Dorsey**

.

*Position:**

AI Model Evaluation Specialist

*Type:
*Contract
Compensation:
$25–$35/hour
*Commitment:
*20 hours/week
*Role Responsibilities
Write realistic prompts reflecting professional and consumer domain-specific guidance.
Evaluate AI-generated responses for factual accuracy and practical usefulness.
Identify fabricated claims and misleading reasoning in model outputs.
Score and rank model responses using structured rubrics.
Provide written justifications with specific evidence for evaluations.
*Qualifications
*Must-Have
Professional experience applying domain expertise in a practitioner or advisory capacity.
Familiarity with industry-specific standards, regulations, or clinical guidelines.
Strong written communication and critical reasoning skills.
*Application Process (Takes 20–30 mins to complete)
Submit your resume to begin.
Complete the Model Response Evaluation assessment.
*Resources & Support**

• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome

For any help or support, reach out to: [email protected]
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*

, Apply tot his job Apply To this Job

Related roles

Senior Product Owner, IaaS (Remote)

Remote · USA Full-time

Staff Product Owner (Oracle Retail)

Remote · USA Full-time

Educational Technology AI Rater & Evaluator

Remote · USA Full-time

Vocational Evaluator

Remote · USA Full-time

AI Decision & Response Analyst

Remote · USA Full-time

NURSE EVALUATOR III, HEALTH SERVICES

Remote · USA Full-time

Finance Model Prompt Evaluator

Remote · USA Full-time

AI Quality Evaluator (Polish)

Remote · USA Full-time

Healthcare Research Evaluator (STEM) | $30/hr Remote

Remote · USA Full-time

Generative AI Evaluator (Russian) | $15/hr Remote

Remote · USA Full-time

Work From Home Data Annotation Operator (AI Training)

Remote · USA Full-time

Associate Director, Clinical Trial Delivery Unit FSP Oversight Manager

Remote · USA Full-time

Experienced Customer Service Associate – Delivering Exceptional Experiences in Largo, FL at arenaflex

Remote · USA Full-time

Senior Process Technology Engineer

Remote · USA Full-time

Head of Data

Remote · USA Full-time

Experienced Full Stack Live Chat Agent – Customer Service & Sales Support

Remote · USA Full-time

Staff Machine Learning Engineer, AI Research

Remote · USA Full-time

Experienced Customer Service Representative – Delivering Exceptional Arenaflex Experiences

Remote · USA Full-time

Experienced Junior Operator Data Entry Specialist – Remote Opportunity with arenaflex

Remote · USA Full-time

Experienced Part-Time Remote Data Entry Clerk – National & Local Paid Focus Groups, Clinical Trials, and Market Research Studies

Remote · USA Full-time