Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Datasets:
thuml
/
VisWorld-Eval
like
2
Follow
THUML @ Tsinghua University
122
Tasks:
Any-to-Any
Modalities:
Image
Text
Formats:
parquet
optimized-parquet
Languages:
English
Size:
1K - 10K
ArXiv:
arxiv:
2601.19834
Tags:
multimodal
reasoning
world-models
Libraries:
Datasets
pandas
Polars
+ 1
Dataset card
Data Studio
Files
Files and versions
xet
Community
2
main
VisWorld-Eval
1.4 GB
2 contributors
History:
8 commits
manchery
nielsr
HF Staff
Add any-to-any task category and language tags (
#2
)
36548b3
6 days ago
assets
Upload benchmark.png
9 days ago
ballgame
Upload 7 files
9 days ago
cube
Upload 7 files
9 days ago
maze
Upload 7 files
9 days ago
mmsi
Upload 7 files
9 days ago
multihop
Upload 7 files
9 days ago
paperfolding
Upload 7 files
9 days ago
sokoban
Upload 7 files
9 days ago
.gitattributes
2.46 kB
initial commit
12 days ago
README.md
5.79 kB
Add any-to-any task category and language tags (#2)
6 days ago