Skip to main content
LLM-as-a-Judge: On the Reliability of LLM Evaluations

LLM-as-a-Judge: On the Reliability of LLM Evaluations

Liang et al.

00
2023-03-30
evaluationllm

Abstract

This paper introduces and evaluates the idea described in “LLM-as-a-Judge: On the Reliability of LLM Evaluations”, and reports empirical results that helped shape subsequent work in evaluation, llm.