Submitted by IlyaGusev 66 PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation · 1 authors 2
Submitted by pkanithi 54 MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications · 10 authors 6
Submitted by akhaliq 20 Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models · 7 authors 2
Submitted by sonta7 20 Gated Slot Attention for Efficient Linear-Time Sequence Modeling · 12 authors 2
Submitted by sandeep123 14 Can Large Language Models Unlock Novel Scientific Research Ideas? · 4 authors 8
Submitted by akhaliq 12 Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering · 10 authors 4
Submitted by akhaliq 11 VMAS: Video-to-Music Generation via Semantic Alignment in Web Music Videos · 5 authors 2
Submitted by thughost 9 ProteinBench: A Holistic Evaluation of Protein Foundation Models · 10 authors 2
Submitted by benbogin 8 SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories · 8 authors 2
Submitted by akhaliq 8 MVLLaVA: An Intelligent Agent for Unified and Flexible Novel View Synthesis · 5 authors 2