HF中国镜像站

Quentin Gallouédec's picture

Quentin Gallouédec

qgallouedec

·

AI & ML interests

None yet

Recent Activity

updated a model about 4 hours ago

qgallouedec/gemma-3-4b-it-codeforces-SFT

published a model about 4 hours ago

qgallouedec/gemma-3-4b-it-codeforces-SFT

updated a model about 18 hours ago

qgallouedec/gemma-3-27b-it-codeforces-SFT

View all activity

Organizations

qgallouedec's activity

updated a model about 4 hours ago

qgallouedec/gemma-3-4b-it-codeforces-SFT

Image-Text-to-Text • Updated about 4 hours ago

published a model about 4 hours ago

qgallouedec/gemma-3-4b-it-codeforces-SFT

Image-Text-to-Text • Updated about 4 hours ago

updated a model about 18 hours ago

qgallouedec/gemma-3-27b-it-codeforces-SFT

Image-Text-to-Text • Updated about 18 hours ago • 1 • 1

published a model about 18 hours ago

qgallouedec/gemma-3-27b-it-codeforces-SFT

Image-Text-to-Text • Updated about 18 hours ago • 1 • 1

updated a model about 19 hours ago

qgallouedec/Qwen2.5-0.5B-codeforces-SFT

Text Generation • Updated about 19 hours ago • 3

published a model about 19 hours ago

qgallouedec/Qwen2.5-0.5B-codeforces-SFT

Text Generation • Updated about 19 hours ago • 3

liked a model 1 day ago

google/gemma-3-27b-pt

Image-Text-to-Text • Updated 1 day ago • 2.18k • 43

upvoted a collection 1 day ago

Gemma 3 Release

9 items • Updated 1 day ago • 208

reacted to fdaudens's post with 🔥 2 days ago

Post

1611

🔥The Open R1 team just dropped OlympicCoder and it's wild:

- 7B model outperforms Claude 3.7 Sonnet on IOI benchmark (yes, 7B!!)
- 32B crushes all open-weight models tested, even those 100x larger 🤯

Open-sourcing the future of code reasoning! 🚀

Check it out https://huggingface.co/blog/open-r1/update-3

upvoted an article 2 days ago

Article

Open R1: Update #3

By

and 9 others •

2 days ago

• 185

published an article 2 days ago

Article

Open R1: Update #3

By

and 9 others •

2 days ago

• 185

upvoted a paper 2 days ago

Proximal Policy Optimization Algorithms

Paper • 1707.06347 • Published Jul 20, 2017 • 8

upvoted an article 2 days ago

Article

The N Implementation Details of RLHF with PPO

Oct 24, 2023

• 42

commented on Preference Optimization for Vision Language Models 8 days ago

I think it's related to this: https://github.com/huggingface/peft/issues/303

updated a Space 13 days ago

Train Memory

Generate memory usage forecast for deep learning models

liked a model 13 days ago

Qwen/Qwen2.5-0.5B

Text Generation • Updated Sep 25, 2024 • 526k • • 229

upvoted a paper 13 days ago

ZeRO: Memory Optimizations Toward Training Trillion Parameter Models

Paper • 1910.02054 • Published Oct 4, 2019 • 5

updated a dataset 13 days ago

trl-lib/documentation-images

Viewer • Updated 13 days ago • 1 • 164k

upvoted a paper 13 days ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 114

updated a dataset 17 days ago

qgallouedec/trl-metrics

Viewer • Updated 17 days ago • 86.1k • 4.48k • 1