2024

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Yuzhi Wang, X. Ma, G. Zhang, Yuan Ni, A. Chandra, S. Guo, W. Ren, A. Arulraj, X. He, Zhejun Jiang, Tao Li, M. Ku, K. Wang, A. Zhuang, R. Fan, Xiang Yue, Weizhu Chen

citations

Cite Score

47

Citation Graph

Loading graph...

References [0]

Sort:
Filter:

No references match the current filters.

Cited by

3

papers in your library

Cites

0

papers in your library

Notes

Tags

canonBenchmark

Paper Aliases

No aliases