×
Register Here to Apply for Jobs or Post Jobs. X

Bilingual LLM Evaluator - Expert

Job in Germany, Pike County, Ohio, USA
Listing for: Mercor
Full Time, Part Time position
Listed on 2026-01-13
Job specializations:
  • IT/Tech
    Data Scientist, Data Analyst
Salary/Wage Range or Industry Benchmark: 36.16 USD Hourly USD 36.16 HOUR
Job Description & How to Apply Below
Location: Germany

Base pay range

$36.16/hr - $36.16/hr

About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark
, General Catalyst
, Peter Thiel
, Adam D'Angelo,
Larry Summers
, and Jack Dorsey
.

Position: AI Model Evaluator

Type:
Full-time or Part-time Contract Work

Compensation: $36/hour

Location:

Geography restricted to Europe, Canada (Quebec), USA

Role Responsibilities
  • Evaluate LLM-generated responses on their ability to effectively answer user queries.
  • Conduct fact-checking using trusted public sources and external tools.
  • Generate high-quality human evaluation data by annotating response strengths, areas for improvement, and factual inaccuracies.
  • Assess reasoning quality, clarity, tone, and completeness of responses.
  • Ensure model responses align with expected conversational behavior and system guidelines.
  • Apply consistent annotations by following clear taxonomies, benchmarks, and detailed evaluation guidelines.
Qualifications Must-Have
  • Bachelor’s degree
  • Native speaker or ILR 5/primary fluency (C2 on the CEFR scale) in French
  • Significant experience using large language models (LLMs)
  • Excellent writing skills
  • Strong attention to detail
  • Adaptable and comfortable moving across topics, domains, and customer requirements
  • Background or experience in domains requiring structured analytical thinking (e.g., research, policy, analytics, linguistics, engineering)
  • Excellent college-level mathematics skills
Preferred
  • Prior experience with RLHF, model evaluation, or data annotation work
  • Experience writing or editing high-quality written content
  • Experience comparing multiple outputs and making fine-grained qualitative judgments
  • Familiarity with evaluation rubrics, benchmarks, or quality scoring systems
Application Process (Takes 20–30 mins to complete)
  • Upload resume
  • AI interview based on your resume
  • Submit form
Resources & Support
  • For details about the interview process and platform information, please check:
  • For any help or support, reach out to:

PS:
Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary