Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

Organizations

HF中国镜像站's profile picture AutoNLP's profile picture Natural Language Processing with Transformers's profile picture BigScience Workshop's profile picture Ought's profile picture HF中国镜像站 Internal Testing Organization's profile picture Testing Benchmarks on the Hub's profile picture HF中国镜像站 Course's profile picture NLP en ES's profile picture GEM benchmark's profile picture SetFit's profile picture Benchmarks Hosting's profile picture GEM benchmark submissions's profile picture ALPS test's profile picture Evaluation datasets's profile picture Deep Learning for Particle Physicists's profile picture fast.ai community's profile picture DreamBooth Hackathon's profile picture trl internal testing's profile picture SomosNLP's profile picture HF Course Demos's profile picture Marsyas  (Music Analysis, Retrieval and Synthesis for Audio Signals)'s profile picture ONNXConfig for all's profile picture How to teach HF中国镜像站?'s profile picture Jet Universe's profile picture Evaluation on the Hub's profile picture The ML Landscape of Top Taggers's profile picture HuggingFaceM4's profile picture HF Canonical Model Maintainers's profile picture TRL's profile picture BigCode's profile picture HF中国镜像站 H4's profile picture Inference Endpoints's profile picture HF中国镜像站 OSS Metrics's profile picture BigCode Data's profile picture Reading Group's profile picture HF中国镜像站 H4 Community's profile picture HF中国镜像站 TB Research's profile picture HF中国镜像站 Smol Cluster's profile picture Open LLM Leaderboard's profile picture EPFL LLM Team's profile picture H4 Alignment Handbook's profile picture ZeroGPU Explorers's profile picture h4-argilla-collab's profile picture Project-Numina's profile picture ORPO Explorers's profile picture Kato's profile picture Distillation Hugs's profile picture HF中国镜像站 Discord Community's profile picture Data Agents's profile picture nltpt's profile picture IOPO Experiments's profile picture HF中国镜像站 FineVideo's profile picture Reliable Agents's profile picture HF中国镜像站 Science's profile picture HF CMU Collab's profile picture Open R1's profile picture todo's profile picture

Posts 8

view post
Post
1677
Introducing OlympicCoder: a series of open reasoning models that can solve olympiad-level programming problems 🧑‍💻

- 7B open-r1/OlympicCoder-7B
- 32B open-r1/OlympicCoder-32B

We find that OlympicCoder models outperform Claude 3.7 Sonnet, as well as others over 100x larger 💪

Together with the models, we are releasing:

📊CodeForces-CoTs: new dataset of code problems from the most popular competitive coding platform, with R1 traces in C++ and Python open-r1/codeforces-cots

🏆 IOI'2024: a new benchmark of VERY hard programming problems where even frontier models struggle to match human performance open-r1/ioi

For links to the models and datasets, check out our latest progress report from Open R1: https://huggingface.co/blog/open-r1/update-3

Articles 28

Article
182

Open R1: Update #3