Submitted by tjpxiaoming 98 Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems · 47 authors 2
Submitted by KennyUTC 55 Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing · 10 authors 1
Submitted by BestWishYsh 34 GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation · 10 authors 1
Submitted by ManTle 24 Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme · 5 authors 2
Submitted by gallilmaimon 20 Scaling Analysis of Interleaved Speech-Text Language Models · 4 authors 1
Submitted by scofield7419 17 JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization · 11 authors 1
Submitted by yuanqianhao 16 ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers · 9 authors 1
Submitted by danxuhk 13 Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation · 8 authors 2
Submitted by Franck-Dernoncourt 9 Efficient Model Selection for Time Series Forecasting via LLMs · 7 authors 1
Submitted by universea 9 Scaling Laws in Scientific Discovery with AI and Robot Scientists · 10 authors 1
Submitted by tuphs 7 Interpreting Emergent Planning in Model-Free Reinforcement Learning · 5 authors 1
Submitted by shyamgopal 5 Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models · 5 authors 1
Submitted by RyanLiu112 5 GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning · 11 authors 1
Submitted by Falcary 5 NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representations · 9 authors 1
Submitted by bedio 4 Instruction-Guided Autoregressive Neural Network Parameter Generation · 4 authors 1
Submitted by zuazo 3 Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages · 4 authors 2
Submitted by smajumdar94 - OpenCodeReasoning: Advancing Data Distillation for Competitive Coding · 8 authors 1