Location
Remote
Posted
June 19, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
- Evaluate LLM-generated responses for accuracy, relevance, and effectiveness across a wide range of topics.
- Perform fact-checking using reliable public sources and external tools.
- Create high-quality human evaluation data by annotating response strengths, gaps, and factual errors.
- Assess reasoning quality, tone, clarity, and completeness of AI-generated outputs.
- Ensure responses follow expected conversational behavior and system guidelines.
- Apply consistent annotations using defined taxonomies, benchmarks, and evaluation frameworks.
Requirements
- Native-level or near-native fluency in French (ILR 5 / CEFR C2) with strong English proficiency.
- Proven experience using large language models and understanding real-world LLM use cases.
- Excellent writing skills with the ability to provide clear, nuanced, and structured feedback.
- Strong attention to detail and analytical thinking. ...