TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation Paper • 2503.04872 • Published 7 days ago • 14
SIFT: Grounding LLM Reasoning in Contexts via Stickers Paper • 2502.14922 • Published 22 days ago • 30
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper • 2503.01743 • Published 10 days ago • 72
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated about 21 hours ago • 441k • 1.12k
Running 2.24k 2.24k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published Feb 10 • 60
yentinglin/Mistral-Small-24B-Instruct-2501-reasoning Text Generation • Updated 21 days ago • 2.19k • 51