Quentin Gallouédec's picture

Quentin Gallouédec

qgallouedec

AI & ML interests

None yet

Recent Activity

updated a model about 4 hours ago
qgallouedec/gemma-3-4b-it-codeforces-SFT
published a model about 4 hours ago
qgallouedec/gemma-3-4b-it-codeforces-SFT
updated a model about 18 hours ago
qgallouedec/gemma-3-27b-it-codeforces-SFT
View all activity

Organizations

HF中国镜像站's profile picture Stable-Baselines3's profile picture trl internal testing's profile picture Jack of All Trades project's profile picture HuggingFaceM4's profile picture TRL's profile picture HF中国镜像站 H4's profile picture HF中国镜像站 OSS Metrics's profile picture cleanrl's profile picture LeRobot's profile picture Open RL Leaderboard's profile picture Paris AI Running Club's profile picture HF SB3 Test's profile picture PDF2Dataset's profile picture IOPO Experiments's profile picture HF中国镜像站 Science's profile picture HF CMU Collab's profile picture Bluesky Community's profile picture ChaosCraft AI's profile picture HF中国镜像站 Agents Course's profile picture Open R1's profile picture HF中国镜像站 Reasoning Course's profile picture

qgallouedec's activity

reacted to fdaudens's post with 🔥 2 days ago
view post
Post
1611
🔥The Open R1 team just dropped OlympicCoder and it's wild:

- 7B model outperforms Claude 3.7 Sonnet on IOI benchmark (yes, 7B!!)
- 32B crushes all open-weight models tested, even those 100x larger 🤯

Open-sourcing the future of code reasoning! 🚀

Check it out https://huggingface.co/blog/open-r1/update-3
upvoted an article 2 days ago
published an article 2 days ago
upvoted an article 2 days ago
view article
Article

The N Implementation Details of RLHF with PPO

42