Submitted by akhaliq 189 OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models · 5 authors 19
Submitted by Myashka 112 The Differences Between Direct Alignment Algorithms are a Blur · 5 authors 1
Submitted by ahmed-masry 36 AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding · 22 authors 2
Submitted by jimi888 29 SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model · 11 authors 5
Submitted by RohitGandikota 25 SliderSpace: Decomposing the Visual Capabilities of Diffusion Models · 6 authors 8
Submitted by xinyan233333 24 DeepRAG: Thinking to Retrieval Step by Step for Large Language Models · 9 authors 2
Submitted by huanqia 24 MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models · 3 authors 2
Submitted by yiren98 20 MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation · 3 authors 2
Submitted by akhaliq 17 ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning · 7 authors 2
Submitted by dongwonjo 16 FastKV: KV Cache Compression for Fast Long-Context Processing with Token-Selective Propagation · 4 authors 2
Submitted by akhaliq 14 The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles · 4 authors 2
Submitted by hba123 11 Almost Surely Safe Alignment of Large Language Models at Inference-Time · 6 authors 2
Submitted by arjunguha 9 PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models · 8 authors 6
Submitted by PAlbert31 9 RandLoRA: Full-rank parameter-efficient fine-tuning of large models · 6 authors 3
Submitted by akshat57 5 Lifelong Sequential Knowledge Editing without Model Degradation · 6 authors 2
Submitted by Bowen232 4 LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information · 6 authors 2
Submitted by vshrivas 4 Language Models Prefer What They Know: Relative Confidence Estimation via Confidence Preferences · 3 authors 2
Submitted by moein99 3 A Study on the Performance of U-Net Modifications in Retroperitoneal Tumor Segmentation · 8 authors 3
Submitted by EdwinDdeJong 2 Current Pathology Foundation Models are unrobust to Medical Center Differences · 3 authors 2