
LLM-as-a-Judge: On the Reliability of LLM Evaluations
Liang et al.
00
2023-03-30
evaluationllm
Abstract
This paper introduces and evaluates the idea described in “LLM-as-a-Judge: On the Reliability of LLM Evaluations”, and reports empirical results that helped shape subsequent work in evaluation, llm.