HF中国镜像站
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1206
65
55
Quentin Gallouédec
qgallouedec
Follow
drukpa's profile picture
reos156's profile picture
vinhnx90's profile picture
181 followers
·
78 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
updated
a model
35 minutes ago
qgallouedec/gemma-3-4b-it-codeforces-SFT
published
a model
36 minutes ago
qgallouedec/gemma-3-4b-it-codeforces-SFT
updated
a model
about 15 hours ago
qgallouedec/gemma-3-27b-it-codeforces-SFT
View all activity
Organizations
Articles
5
Article
182
Open R1: Update #3
Article
294
Open-R1: Update #1
View all Articles
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
1
Running
9
Train Memory
📈
Generate memory usage forecast for deep learning models
models
718
Sort: Recently updated
qgallouedec/gemma-3-4b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
35 minutes ago
qgallouedec/gemma-3-27b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
about 15 hours ago
•
1
•
1
qgallouedec/Qwen2.5-0.5B-codeforces-SFT
Text Generation
•
Updated
about 16 hours ago
•
3
qgallouedec/Qwen2.5-0.5B-GRPO-main
Text Generation
•
Updated
22 days ago
•
24
qgallouedec/gemma-2-2B-it-thinking-function_calling
Updated
23 days ago
qgallouedec/Qwen2.5-0.5B-GRPO-2873
Updated
24 days ago
qgallouedec/Qwen2.5-0.5B-GRPO-2776-next
Updated
29 days ago
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
Feb 8
•
31
qgallouedec/Qwen2.5-32B-Open-R1-GRPO
Updated
Feb 6
•
1
qgallouedec/Qwen2.5-14B-Open-R1-GRPO
Updated
Feb 6
Expand 718 models
datasets
67
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
17 days ago
•
86.1k
•
4.48k
•
1
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
160
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
62
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
104
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
64
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
68
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9, 2024
•
179k
•
84
qgallouedec/tldr
Viewer
•
Updated
Sep 9, 2024
•
130k
•
75
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
63
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
82
Expand 67 datasets