📍 Local Job Near You

Generative ai evaluator | $30/hr remote

🏢

Crossing Hurdles

📍 Remote, South-Africa

📍

Location Remote

📅

Posted June 05, 2026

🚗

Commute Local Area

🎯

Local Opportunity Near You!

This job is in your area. Enjoy a short commute and work close to home.

📋
Job Description

Type: Hourly contract 
Compensation: $20–$30/hour 
Location: Remote 
Commitment: 10–40 hours/week 
Role Responsibilities Evaluate outputs from large language models and autonomous agent systems using defined rubrics and quality standards. Review multi-step agent workflows, including screenshots and reasoning traces, to assess accuracy and completeness. Apply benchmarking criteria consistently while identifying edge cases and recurring failure patterns. Provide structured, actionable feedback to support model refinement and product improvements. Participate in calibration sessions to ensure consistent evaluation alignment across reviewers. Adapt to evolving guidelines and ambiguous scenarios with sound judgment. Document findings clearly and communicate insights to relevant stakeholders. 
Requirements Strong experience in LLM evaluation, AI output analysis, QA/testing, UX research, or similar analytical roles. Proficiency in rubric-based scoring, benchmarking frameworks, an...
                

Apply for This Job

Submit Application

Quick and secure application process

📍 Location Details

🌆

City

Remote

🗺️

Country

South-Africa

🚗

Commute

Local Area

🔍 More Jobs Nearby

Explore other opportunities in Remote

View Local Jobs

Generative ai evaluator | $30/hr remote

📋 Job Description

Apply for This Job

📍 Location Details

🔍 More Jobs Nearby

📋
Job Description