Information Systems Expert - AI Evaluator

Remote · USA Full-time New today

• *About The Job

*Mercor

connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include

*Benchmark**

,

*General Catalyst**

,

*Peter Thiel**

,

*Adam D'Angelo**

,

*Larry Summers**

, and

*Jack Dorsey**

.

*Position:**

AI Model Evaluation Specialist

*Type:
*Contract
Compensation:
$40–$60/hour
*Commitment:
*20 hours/week
*Role Responsibilities
Write realistic prompts reflecting how professionals and consumers seek domain-specific guidance.
Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness.
Identify fabricated claims, incorrect references, or misleading reasoning across model outputs.
Score and rank multiple model responses using structured rubrics across dimensions.
Provide written justifications with specific evidence for each evaluation.
*Qualifications
*Must-Have
Master’s degree or higher in Computer Science, Information Systems, or a relevant professional field.
Professional experience applying domain expertise in a practitioner or advisory capacity.
Familiarity with industry-specific standards, regulations, or clinical guidelines.
Strong written communication and critical reasoning skills.
*Application Process (Takes 20–30 mins to complete)
Submit your resume to begin.
Complete the Model Response Evaluation assessment.
*Resources & Support**

• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome

For any help or support, reach out to: [email protected]
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*

, Apply tot his job Apply To this Job

Related roles

BDI Evaluator

Remote · USA Full-time

AI Writing Evaluators (Domain Experts) - English Expertise

Remote · USA Full-time

Part-time AI Writing Evaluator (Tier 1)

Remote · USA Full-time

Business Research Evaluator | $30/hr Remote

Remote · USA Full-time

Social Media Evaluator (Ukrainian-United States)

Remote · USA Full-time

Qualified Medical Evaluator (QME) - Pain Medicine Physician - Part Time

Remote · USA Full-time

Regional Vocational Evaluation Specialist

Remote · USA Full-time

Lead Program Evaluator – Title III / Federal Education Grants

Remote · USA Full-time

Spanish Speaking CFTSS OLP Supervisor/Evaluator (Remote)

Remote · USA Full-time

Manufacturing Expert - Quality Evaluator

Remote · USA Full-time

Patient Access Schedule Agent-SHARE Partially Remote 930A-6P

Remote · USA Full-time

Experienced Remote Data Entry Specialist and Research Panelist – Flexible Work Arrangements at arenaflex

Remote · USA Full-time

Experienced Customer Service Representative – Insurance Industry Expert – Work from Home Opportunity

Remote · USA Full-time

Hospital Coding Specialist II- Technology Auditor

Remote · USA Full-time

Remote Customer Experience Advisor – Work From Anywhere, Full-Time or Part-Time!

Remote · USA Full-time

Experienced Customer Service Representative – Remote Opportunity at arenaflex

Remote · USA Full-time

Experienced Part-Time Remote Data Entry Specialist – Amazon E-commerce Operations

Remote · USA Full-time

Experienced Non-Voice Customer Service Agent - Temporary | arenaflex US

Remote · USA Full-time

Account Executive

Remote · USA Full-time

Experienced Customer Support Specialist (Remote) at arenaflex

Remote · USA Full-time