Submitted by akhaliq 44 UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning · 8 authors 3
Submitted by luojunyu 36 Large Language Model Agent: A Survey on Methodology, Applications and Challenges · 26 authors 2
Submitted by ToheartZhang 34 Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models · 8 authors 4
Submitted by Ziqi 27 VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness · 11 authors 2
Submitted by akhaliq 21 LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis · 13 authors 2
Submitted by ZhiCheng0326 17 ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation · 8 authors 2
Submitted by Xueqing 17 FinAudio: A Benchmark for Audio Large Language Models in Financial Applications · 13 authors 2
Submitted by akhaliq 16 ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model · 6 authors 3
Submitted by Paper99 15 Lumina-Image 2.0: A Unified and Efficient Image Generative Framework · 23 authors 2
Submitted by zwq2018 15 Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks · 13 authors 3
Submitted by ZonglinY 15 ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition · 10 authors 2
Submitted by huangsiteng 9 Exploring the Evolution of Physics Cognition in Video Generation: A Survey · 11 authors 2
Submitted by akhaliq 6 Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields · 11 authors 2
Submitted by dovpie 4 Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation · 11 authors 2
Submitted by imranraad 4 LLPut: Investigating Large Language Models for Bug Report-Based Input Generation · 4 authors 2
Submitted by Trickyjustice 1 LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image Editing · 3 authors 2
Submitted by nielsr 1 Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better · 2 authors 2