HF中国镜像站

Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a model 26 minutes ago

open-r1/OlympicCoder-32B

updated a model 29 minutes ago

open-r1/OlympicCoder-7B

new activity 41 minutes ago

open-r1/codeforces-cots:Why is there a discrepancy between the 'Solutions' subset and the 'Solutions_py' subset?

View all activity

Organizations

Posts 8

Post

1677

Introducing OlympicCoder: a series of open reasoning models that can solve olympiad-level programming problems 🧑‍💻

- 7B open-r1/OlympicCoder-7B
- 32B open-r1/OlympicCoder-32B

We find that OlympicCoder models outperform Claude 3.7 Sonnet, as well as others over 100x larger 💪

Together with the models, we are releasing:

📊CodeForces-CoTs: new dataset of code problems from the most popular competitive coding platform, with R1 traces in C++ and Python open-r1/codeforces-cots

🏆 IOI'2024: a new benchmark of VERY hard programming problems where even frontier models struggle to match human performance open-r1/ioi

For links to the models and datasets, check out our latest progress report from Open R1: https://huggingface.co/blog/open-r1/update-3

Articles 28

Article

182

Open R1: Update #3

View all Articles

Collections 4

Papers 9

arxiv:2503.07572

arxiv:2502.02737

arxiv:2310.16944

arxiv:2303.12582

spaces 20

Chuck Norris Jokes

Fetch a random Chuck Norris joke

OpenGPT

Explain physics concepts like Feynman

Donut Docvqa

Argilla Space Template

No application file

Chip

Dreambooth Training

models 271

lewtun/Qwen2.5-7B-Instruct-GRPO

Updated 6 days ago

lewtun/Qwen2.5-Math-1.5B-Instruct-GRPO

Updated 7 days ago

lewtun/dummy-config-test

Text Generation • Updated 21 days ago • 10

lewtun/Qwen2.5-1.5B-Open-R1-Code-GRPO

Updated 23 days ago

lewtun/Qwen2.5-1.5B-Open-R1-Distill

Updated 23 days ago • 54

lewtun/smollm2-distill-default-chat-template

Text Generation • Updated 24 days ago • 80

lewtun/qwen2.5-1.5b-distill-default-chat-template

Updated 24 days ago • 17

lewtun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Feb 7 • 23

lewtun/Qwen-2.5-7B-Simple-RL

lewtun/DeepSeek-R1-Distill-Qwen-7B-GRPO

datasets 71

lewtun/details_Qwen__Qwen2.5-Coder-3B-Instruct

Viewer • Updated 21 days ago • 33 • 176

lewtun/details_deepseek-ai__DeepSeek-R1-Distill-Qwen-1.5B

Viewer • Updated Feb 6 • 1k • 140

lewtun/details_open-thoughts__OpenThinker-7B

Viewer • Updated Feb 5 • 597 • 82

lewtun/details_deepseek-ai__DeepSeek-R1-Distill-Qwen-7B

Viewer • Updated Feb 5 • 597 • 154

lewtun/details_meta-llama__Llama-3.2-3B-Instruct

Viewer • Updated Feb 5 • 1.74k • 86

lewtun/details_deepseek-ai__DeepSeek-R1-Distill-Llama-8B

Viewer • Updated Feb 5 • 598 • 229

lewtun/details_meta-llama__Llama-3.1-8B-Instruct

Viewer • Updated Feb 5 • 597 • 95

lewtun/details_Qwen__Qwen2.5-1.5B-Instruct

Viewer • Updated Feb 5 • 2.25k • 137

lewtun/details_Qwen__Qwen2.5-0.5B-Instruct

Viewer • Updated Feb 5 • 898 • 68

lewtun/details_meta-llama__Llama-3.2-1B-Instruct

Viewer • Updated Feb 5 • 898 • 61