Zhang Yuanhan
ZhangYuanhan
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
7 minutes ago
BIMBA: Selective-Scan Compression for Long-Range Video Question
Answering
updated
a dataset
21 minutes ago
lmms-lab/haha
updated
a collection
about 18 hours ago
Vision Language General
Organizations
Collections
2
Vision Language General
-
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
Paper • 2410.10563 • Published • 38 -
Latent Action Pretraining from Videos
Paper • 2410.11758 • Published • 2 -
TVBench: Redesigning Video-Language Evaluation
Paper • 2410.07752 • Published • 6 -
Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
Paper • 2501.03225 • Published • 7