HF中国镜像站

Quentin Gallouédec's picture

Quentin Gallouédec

qgallouedec

·

AI & ML interests

None yet

Recent Activity

updated a model 35 minutes ago

qgallouedec/gemma-3-4b-it-codeforces-SFT

published a model 36 minutes ago

qgallouedec/gemma-3-4b-it-codeforces-SFT

updated a model about 15 hours ago

qgallouedec/gemma-3-27b-it-codeforces-SFT

View all activity

Organizations

Articles 5

Article

182

Open R1: Update #3

Article

294

Open-R1: Update #1

View all Articles

Papers 4

arxiv:2402.09844

arxiv:2402.03046

arxiv:2208.14928

arxiv:2106.13687

spaces 1

Train Memory

Generate memory usage forecast for deep learning models

models 718

qgallouedec/gemma-3-4b-it-codeforces-SFT

Image-Text-to-Text • Updated 35 minutes ago

qgallouedec/gemma-3-27b-it-codeforces-SFT

Image-Text-to-Text • Updated about 15 hours ago • 1 • 1

qgallouedec/Qwen2.5-0.5B-codeforces-SFT

Text Generation • Updated about 16 hours ago • 3

qgallouedec/Qwen2.5-0.5B-GRPO-main

Text Generation • Updated 22 days ago • 24

qgallouedec/gemma-2-2B-it-thinking-function_calling

Updated 23 days ago

qgallouedec/Qwen2.5-0.5B-GRPO-2873

Updated 24 days ago

qgallouedec/Qwen2.5-0.5B-GRPO-2776-next

Updated 29 days ago

qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO

Text Generation • Updated Feb 8 • 31

qgallouedec/Qwen2.5-32B-Open-R1-GRPO

Updated Feb 6 • 1

qgallouedec/Qwen2.5-14B-Open-R1-GRPO

datasets 67

qgallouedec/trl-metrics

Viewer • Updated 17 days ago • 86.1k • 4.48k • 1

qgallouedec/prm800k

Viewer • Updated Dec 17, 2024 • 41.2k • 160 • 3

qgallouedec/ultrafeedback-prompt

Viewer • Updated Sep 9, 2024 • 60.9k • 62

qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness

Viewer • Updated Sep 9, 2024 • 16.6k • 104

qgallouedec/lm-human-preferences-descriptiveness

Viewer • Updated Sep 9, 2024 • 6.26k • 64

qgallouedec/lm-human-preferences-sentiment

Viewer • Updated Sep 9, 2024 • 6.26k • 68

qgallouedec/tldr-preference

Viewer • Updated Sep 9, 2024 • 179k • 84

qgallouedec/tldr

Viewer • Updated Sep 9, 2024 • 130k • 75

qgallouedec/hh-rlhf-helpful-base

Viewer • Updated Sep 5, 2024 • 46.2k • 63

qgallouedec/hh-rlhf-helpful-base-trl-style

Viewer • Updated Sep 5, 2024 • 46.2k • 82