HF中国镜像站

Zonghao Guo's picture

3 4

Zonghao Guo

guozonghao96

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

upvoted a paper 2 months ago

Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

upvoted a paper 3 months ago

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer

View all activity

Organizations

None yet

Papers 4

arxiv:2503.12797

arxiv:2503.12303

arxiv:2412.13871

arxiv:2403.11703

models 1

guozonghao96/llava-uhd-144-13b

Text Generation • Updated Jul 30, 2024 • 67 • 1

datasets 2

guozonghao96/ocr_vqa_image

Updated Aug 4, 2024 • 3

guozonghao96/objects365

Updated Jul 9, 2024 • 83