view post Post 1611 🔥The Open R1 team just dropped OlympicCoder and it's wild:- 7B model outperforms Claude 3.7 Sonnet on IOI benchmark (yes, 7B!!)- 32B crushes all open-weight models tested, even those 100x larger 🤯Open-sourcing the future of code reasoning! 🚀Check it out https://huggingface.co/blog/open-r1/update-3 See translation 🔥 6 6 🤯 1 1 + Reply
ZeRO: Memory Optimizations Toward Training Trillion Parameter Models Paper • 1910.02054 • Published Oct 4, 2019 • 5