HF中国镜像站
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3
4
Zonghao Guo
guozonghao96
Follow
AI & ML interests
None yet
Recent Activity
authored
a paper
3 days ago
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
upvoted
a
paper
2 months ago
Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models
upvoted
a
paper
3 months ago
LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer
View all activity
Organizations
None yet
Papers
4
arxiv:
2503.12797
arxiv:
2503.12303
arxiv:
2412.13871
arxiv:
2403.11703
models
1
guozonghao96/llava-uhd-144-13b
Text Generation
•
Updated
Jul 30, 2024
•
67
•
1
datasets
2
Sort: Recently updated
guozonghao96/ocr_vqa_image
Updated
Aug 4, 2024
•
3
guozonghao96/objects365
Updated
Jul 9, 2024
•
83