53 32 19

YSH

BestWishYsh

https://shyuanbest.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper about 3 hours ago

CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance

upvoted a paper about 5 hours ago

Long Context Tuning for Video Generation

upvoted a paper about 9 hours ago

CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance

View all activity

Organizations

BestWishYsh's activity

authored a paper about 3 hours ago

CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance

Paper • 2503.10391 • Published about 22 hours ago • 5

upvoted a paper about 5 hours ago

Long Context Tuning for Video Generation

Paper • 2503.10589 • Published about 18 hours ago • 6

upvoted a paper about 9 hours ago

CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance

Paper • 2503.10391 • Published about 22 hours ago • 5

commented a paper about 9 hours ago

CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance

Paper • 2503.10391 • Published about 22 hours ago • 5 •

upvoted a paper 3 days ago

VACE: All-in-One Video Creation and Editing

Paper • 2503.07598 • Published 4 days ago • 33

commented a paper 3 days ago

VACE: All-in-One Video Creation and Editing

Paper • 2503.07598 • Published 4 days ago • 33 •

upvoted a paper 3 days ago

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

Paper • 2503.07265 • Published 4 days ago • 4

commented a paper 3 days ago

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

Paper • 2503.07265 • Published 4 days ago • 4 •

liked a model 10 days ago

Wan-AI/Wan2.1-T2V-14B-Diffusers

Text-to-Video • Updated 10 days ago • 10.2k • 18

upvoted a paper 11 days ago

MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing

Paper • 2502.21291 • Published 14 days ago • 4

commented a paper 11 days ago

MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing

Paper • 2502.21291 • Published 14 days ago • 4 •

upvoted a paper 14 days ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published 15 days ago • 27

commented a paper 14 days ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published 15 days ago • 27 •

upvoted a paper 18 days ago

FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation

Paper • 2502.13995 • Published 23 days ago • 8

commented a paper 18 days ago

FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation

Paper • 2502.13995 • Published 23 days ago • 8 •

updated a Space 18 days ago

ConsisID-preview

🔥

Identity-Preserving Text-to-Video Generation

upvoted a paper 25 days ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published 28 days ago • 51

liked a model 25 days ago

stepfun-ai/stepvideo-t2v

Text-to-Video • Updated 23 days ago • 1.92k • 416

upvoted a paper 28 days ago

VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer

Paper • 2502.05979 • Published Feb 9 • 8

commented a paper 28 days ago

VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer

Paper • 2502.05979 • Published Feb 9 • 8 •