MorphoBench
paperAdaptive difficulty reasoning benchmark with 1,300+ test questions. Validated against advanced reasoning models including o3 and GPT-5. Designed to dynamically adjust question difficulty to expose reasoning capability gaps that fixed benchmarks miss.
Open-sourced via OpenDCAI organization, one of the first major benchmark contributions from ZGCA.