Submitted by akhaliq 37 Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search · 11 authors 2
Submitted by akhaliq 17 Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models · 5 authors 2
Submitted by spapi 8 How "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System? · 4 authors 2
Submitted by pranamanam 4 PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion · 3 authors 2