Submitted by Daoguang 30 Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving · 19 authors 2
Submitted by Ningyu 15 SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement · 11 authors 2
Submitted by yifanzhang114 10 MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models · 9 authors 4
Submitted by akhaliq 10 APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay · 15 authors 2
Submitted by BestWishYsh 10 VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning · 8 authors 2
Submitted by Zhaorun 9 ShieldAgent: Shielding Agents via Verifiable Safety Policy Reasoning · 3 authors 2
Submitted by akhaliq 7 Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization · 12 authors 2
Submitted by akhaliq 6 HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration · 9 authors 2
Submitted by yyzqy 5 EvMic: Event-based Non-contact sound recovery from effective spatial-temporal modeling · 9 authors 2
Submitted by alokabhishek 4 BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models · 3 authors 2
Submitted by bmay 3 Real-is-Sim: Bridging the Sim-to-Real Gap with a Dynamic Digital Twin for Real-World Robot Policy Evaluation · 7 authors 2
Submitted by nielsr 3 Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery · 7 authors 2
Submitted by andito 3 Slow-Fast Architecture for Video Multi-Modal Large Language Models · 9 authors 2
Submitted by ChaosLiao 1 SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning · 9 authors 2