Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Reset Other
benchmark
art
Synthetic
medical
code
biology
finance
legal
chemistry
music
climate
agent
Apply filters
Datasets
1,032
Full-text search
Edit filters
Sort: Trending
Active filters:
benchmark
Clear all
tencent/CL-bench
Viewer
•
Updated
1 day ago
•
1.9k
•
682
•
78
DAGroup-PKU/RoVid-X
Viewer
•
Updated
13 days ago
•
108
•
4.06k
•
48
naplab/AVMeme-Exam
Updated
10 days ago
•
74
•
17
DietCoke4671/BlenderBench
Viewer
•
Updated
15 days ago
•
27
•
194
•
22
MiniMaxAI/VIBE
Viewer
•
Updated
Dec 23, 2025
•
200
•
1.34k
•
271
Sylvest/LIBERO-plus
Updated
Oct 17, 2025
•
649
•
21
MiniMaxAI/OctoCodingBench
Viewer
•
Updated
25 days ago
•
72
•
18k
•
262
sylvainHellin/ifc-bench
Viewer
•
Updated
2 days ago
•
47
•
124
•
3
google/simpleqa-verified
Viewer
•
Updated
Sep 22, 2025
•
1k
•
3.76k
•
32
stepfun-ai/AndroidDaily
Viewer
•
Updated
Dec 19, 2025
•
235
•
175
•
12
baochenfu/MMKU-Bench
Updated
4 days ago
•
71
•
2
sarvamai/olmOCR-Bench-English
Viewer
•
Updated
2 days ago
•
1.26k
•
23
•
2
sarvamai/tts-general-benchmark
Viewer
•
Updated
1 day ago
•
1.82k
•
42
•
2
difraud/difraud
Updated
Aug 2, 2024
•
412
•
5
google/FACTS-grounding-public
Viewer
•
Updated
Dec 19, 2024
•
868
•
1.68k
•
41
jablonkagroup/ChemBench
Viewer
•
Updated
Dec 19, 2025
•
2.79k
•
1.91k
•
16
tasl-lab/uniocc
Updated
Aug 19, 2025
•
689k
•
3
AmazonScience/document-haystack
Updated
Aug 4, 2025
•
31.8k
•
19
luogu-llm-research/LACPT
Updated
Jun 29, 2025
•
56
•
3
OpenSafetyLab/t2i_safety_dataset
Updated
Aug 5, 2025
•
352
•
1
zai-org/CC-Bench-trajectories
Viewer
•
Updated
Sep 30, 2025
•
260
•
723
•
90
Fancylalala/AEGIS
Viewer
•
Updated
Oct 8, 2025
•
9.53k
•
32
•
3
wcy1122/Long-TTS-Eval
Viewer
•
Updated
Oct 6, 2025
•
1.24k
•
264
•
9
etri-vilab/holisafe-bench
Viewer
•
Updated
Nov 16, 2025
•
4.03k
•
355
•
9
cais/rli-public-set
Updated
Nov 3, 2025
•
528
•
3
IAAR-Shanghai/VAR
Updated
Nov 14, 2025
•
22
•
1
bird-of-paradise/muon-distributed-reproducibility
Updated
Nov 30, 2025
•
7
•
2
sagecontinuum/FireBench
Viewer
•
Updated
about 24 hours ago
•
4.08k
•
193
•
1
ModalityDance/Omni-Bench
Viewer
•
Updated
2 days ago
•
800
•
59
•
1
MiniMaxAI/OctoBench
Preview
•
Updated
23 days ago
•
451
•
19
Previous
1
2
3
...
35
Next