LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL Paper • 2503.07536 • Published 4 days ago • 73
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model Paper • 2503.07703 • Published 4 days ago • 30
Gemini Embedding: Generalizable Embeddings from Gemini Paper • 2503.07891 • Published 4 days ago • 25
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning Paper • 2503.07572 • Published 4 days ago • 32
Implicit Reasoning in Transformers is Reasoning through Shortcuts Paper • 2503.07604 • Published 4 days ago • 17
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation Paper • 2503.06594 • Published 5 days ago • 4