HF中国镜像站

Leandro von Werra's picture

Leandro von Werra

lvwerra

·

https://github.com/lvwerra

AI & ML interests

NLP and RL

Recent Activity

updated a dataset about 24 hours ago

lvwerra/admin

upvoted an article 1 day ago

Open R1: Update #3

published an article 1 day ago

Open R1: Update #3

View all activity

Organizations

lvwerra's activity

liked a Space 7 days ago

QwQ 32B Demo

Generate text responses to user prompts

liked a Space 17 days ago

Open LLM Progress Tracker

Visualize Open vs. Proprietary LLM Progress

liked a Space 22 days ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked a Space about 1 month ago

DABstep Leaderboard

DABstep Reasoning Benchmark Leaderboard

liked a model about 2 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 17 days ago • 2.75M • • 11.3k

liked 2 Spaces 3 months ago

Jupyter Agent

Create and run Jupyter notebooks interactively

Scaling test-time compute

Enhance math problem solving by scaling test-time compute

liked a dataset 3 months ago

microsoft/RedStone

Updated Dec 5, 2024 • 226 • 33

liked a dataset 4 months ago

ylecun/mnist

Viewer • Updated Aug 8, 2024 • 70k • 29.4k • 159

liked a Space 4 months ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

Evaluate multilingual models using FineTasks

liked a model 4 months ago

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated 7 days ago • 604k • • 573

liked 2 Spaces 5 months ago

CinePileLeaderboard

Video-LLM evaluations on CinePile's evaluation split.

TxT360: Trillion Extracted Text

Create a large, deduplicated dataset for LLM pre-training

liked a dataset 6 months ago

HuggingFaceFV/finevideo

Viewer • Updated Dec 16, 2024 • 39.5k • 8.87k • 302

liked 2 models 7 months ago

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 6.18M • • 3.74k

google/gemma-2-2b

Text Generation • Updated Aug 7, 2024 • 261k • 527

liked 2 Spaces 9 months ago

BigCodeBench Leaderboard

Explore and analyze code evaluation data

FineWeb: decanting the web for the finest text data at scale

Generate high-quality web text data for LLM training

liked a dataset 10 months ago

tomg-group-umd/cinepile

Viewer • Updated Oct 23, 2024 • 608k • 252 • 79

liked a model 11 months ago

bigcode/starcoder2-15b-instruct-v0.1

Text Generation • Updated Nov 3, 2024 • 1.01k • 101