Leading benchmark for evaluating Chinese LLMs across 52 disciplines.

Outputs 2

C-Eval Benchmark

dataset

Leading benchmark for evaluating Chinese LLMs across 52 disciplines.

C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite

paper

Paper introducing the leading benchmark for evaluating Chinese LLMs across 52 disciplines.

arXiv: 2305.08322

benchmarknlpevaluation