C-Eval
dataset paperLeading benchmark for evaluating Chinese LLMs across 52 disciplines.
Outputs 2
C-Eval Benchmark
datasetLeading benchmark for evaluating Chinese LLMs across 52 disciplines.
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite
paperPaper introducing the leading benchmark for evaluating Chinese LLMs across 52 disciplines.
arXiv: 2305.08322