πŸ“ Jobs Near Me
πŸ“

HiringNearMe.work

Local Jobs, Zero Commute

πŸ“ Local Job Near You

Working Student (m/f/d) LLM Agent Evaluation & Benchmarking

🏒
Agile Robots SE
πŸ“ Munich, Germany
πŸ“
Location Munich
πŸ“…
Posted June 03, 2026
πŸš—
Commute Local Area
🎯
Local Opportunity Near You!

This job is in your area. Enjoy a short commute and work close to home.

πŸ“‹
Job Description

About the role

We are looking for a Working Student (m/f/d) LLM Agent Evaluation & Benchmarking. In this role, you will design and build an agent-agnostic benchmarking harness, run comparative evaluations across frontier and local models, and translate findings into prompt, guard, and tool-schema improvements.


Your Responsibilities

  • Harness Development: Design and build an agent-agnostic benchmarking harness that executes versioned task suites against frontier and local models with reproducible, version-controlled runs.
  • Task Suite Design: Define and maintain evaluation task suites that measure task success, grounding accuracy, latency, and cost across the agent portfolio.
  • Model Evaluation: Run period...

Apply for This Job

Submit Application

Quick and secure application process

πŸ“ Location Details

πŸŒ†
City
Munich
πŸ—ΊοΈ
Country
Germany
πŸš—
Commute
Local Area

πŸ” More Jobs Nearby

Explore other opportunities in Munich

View Local Jobs