All roles

Professional Evaluator - Fully Remote | Upto $35/hr Hourly

Remote · USA Full-time New today

About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey. Position: AI Model Evaluation Contractor Type: Contract Compensation: $25–$35/hour Commitment: 20 hours/week Role Responsibilities

  • Write realistic prompts reflecting professional and consumer domain-specific guidance.
  • Evaluate AI-generated responses for factual accuracy, regulatory correctness, and practical usefulness.
  • Identify fabricated claims, incorrect references, or misleading reasoning in model outputs.
  • Score and rank multiple model responses using structured rubrics across dimensions.
  • Provide written justifications with specific evidence for each evaluation.

Qualifications

Must-Have

  • Professional experience applying domain expertise in a practitioner or advisory capacity.
  • Familiarity with industry-specific standards, regulations, or clinical guidelines.
  • Strong written communication and critical reasoning skills.

Application Process (Takes 20–30 mins to complete)

  • Submit your resume to begin.
  • Complete the Model Response Evaluation assessment.

Resources & Support

  • For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
  • For any help or support, reach out to: [email protected]

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity. Apply tot his job Apply To this Job

Related roles

Audio Evaluator - Fully Remote | Upto $50/hr Hourly

Remote · USA Full-time

Special Investigations Unit, Investigator- Remote

Remote · USA Full-time

Healthcare Fraud Investigator - Case Development- Remote

Remote · USA Full-time

Client Service Advisor

Remote · USA Full-time

(US) Customer Success Manager, Senior Living – Remote, USA

Remote · USA Full-time

Logistics Coordinator (Entry Level)

Remote · USA Full-time

Coordinator, Talent

Remote · USA Full-time

Remote | travel logistics coordinator

Remote · USA Full-time

Remote Backend Data Entry Jobs for College Students

Remote · USA Full-time

Remote Customer Onboarding Specialist – Tech Services

Remote · USA Full-time

Senior Salesforce Developer (Certinia/FinancialForce)

Remote · USA Full-time

Experienced Data Entry Specialist – Part-Time Remote Opportunity at arenaflex

Remote · USA Full-time

Mortgage Loan Underwriter

Remote · USA Full-time

GCP Practice Architect II-AI/ML

Remote · USA Full-time

Experienced Remote Healthcare Data Entry Specialist – Entry-Level Opportunity for Career Growth at arenaflex

Remote · USA Full-time

Contact Center Specialist Remote

Remote · USA Full-time

INDEPENDENT CONTRACTOR

Remote · USA Full-time

Experienced Remote Customer Interaction Specialist – Flexible Hours, Competitive Pay, and Unlimited Growth Opportunities at arenaflex

Remote · USA Full-time

Experienced Chat Support Representative – Work from Home with arenaflex

Remote · USA Full-time

Experienced Office Service Clerk / Data Entry Clerk / Customer Service Representative - Fully Remote Opportunity at arenaflex

Remote · USA Full-time