Location
Durham
Posted
June 10, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
**Role Overview**
We are seeking a **Senior AI Scientist** to lead the design, development, and operationalization of evaluation frameworks for Generative AI systems, with a primary focus on Large Language Models (LLMs) and agentic AI solutions.
This role will be responsible for defining and implementing robust methods to assess quality, safety, reliability, and business impact across LLM-powered applications and multi-agent workflows. The position operates within regulated environments such as life sciences, clinical research, and regulatory domains, ensuring that AI systems meet enterprise and compliance standards.
**Key Responsibilities**
**1. LLM Evaluation & Benchmarking**
+ Design and implement scalable evaluation frameworks for LLMs across use cases including:
+ Question answering, summarization, information extraction, and reasoning
+ Clinical and regulatory document generation (e.g., ICFs, CSRs, protocols)
+ Develop both ...
We are seeking a **Senior AI Scientist** to lead the design, development, and operationalization of evaluation frameworks for Generative AI systems, with a primary focus on Large Language Models (LLMs) and agentic AI solutions.
This role will be responsible for defining and implementing robust methods to assess quality, safety, reliability, and business impact across LLM-powered applications and multi-agent workflows. The position operates within regulated environments such as life sciences, clinical research, and regulatory domains, ensuring that AI systems meet enterprise and compliance standards.
**Key Responsibilities**
**1. LLM Evaluation & Benchmarking**
+ Design and implement scalable evaluation frameworks for LLMs across use cases including:
+ Question answering, summarization, information extraction, and reasoning
+ Clinical and regulatory document generation (e.g., ICFs, CSRs, protocols)
+ Develop both ...