HF中国镜像站

Anton Lozhkov's picture

Anton Lozhkov

anton-l

·

AI & ML interests

Generative Models, Distributed Training, Photo and Video Enhancement

Recent Activity

liked a model 1 day ago

HuggingFaceTB/SmolLM2-1.7B-Instruct-16k

published an article 2 days ago

Open R1: Update #3

liked a dataset 6 days ago

HuggingFaceTB/dclm-edu

View all activity

Organizations

Posts 1

Post

2576

Introducing 📐𝐅𝐢𝐧𝐞𝐌𝐚𝐭𝐡: the best public math pre-training dataset with 50B+ tokens!
HuggingFaceTB/finemath

Math remains challenging for LLMs and by training on FineMath we see considerable gains over other math datasets, especially on GSM8K and MATH.

We build the dataset by:
🛠️ carefully extracting math data from Common Crawl;
🔎 iteratively filtering and recalling high quality math pages using a classifier trained on synthetic annotations to identify math reasoning and deduction.

We conducted a series of ablations comparing the performance of Llama-3.2-3B-Base after continued pre-training on FineMath and observe notable gains compared to the baseline model and other public math datasets.

We hope this helps advance the performance of LLMs on math and reasoning! 🚀
We’re also releasing all the ablation models as well as the evaluation code.

HuggingFaceTB/finemath-6763fb8f71b6439b653482c2

Articles 6

Article

183

Open R1: Update #3

View all Articles

Papers 5

arxiv:2502.02737

arxiv:2406.17557

arxiv:2402.19173

arxiv:2306.16527

spaces 3

Kinda-English ruDALL-E

Html Parser Viz

YouTube Streaming ASR

models 70

anton-l/bert_snowflake_regression

Updated May 6, 2024

anton-l/ddpm-butterflies-128

Updated Aug 3, 2023 • 413 • 9

anton-l/ddpm-butterflies-128-test

Updated Jan 11, 2023

anton-l/dream-sna2

Text-to-Image • Updated Jan 8, 2023 • 5

anton-l/dream-sna

Text-to-Image • Updated Jan 8, 2023 • 7

anton-l/ddpm-ema-flowers-64-2gpu

Updated Jan 5, 2023 • 15

anton-l/ddpm-ema-flowers-64-testt

Updated Dec 19, 2022 • 7

anton-l/wav2vec2-base-superb-sv

Audio Classification • Updated Nov 11, 2022 • 943 • 3

anton-l/ddpm-ema-flowers-64-test

Updated Oct 27, 2022 • 5

anton-l/gpt-j-tiny-random

Text Generation • Updated Oct 24, 2022 • 2.12k • 1

datasets 24

anton-l/superb_dummy

Updated Sep 10, 2024 • 1.82k

anton-l/superb

Updated Sep 10, 2024 • 86 • 1

anton-l/superb_demo

Updated Aug 14, 2024 • 2.54k • 1

anton-l/fw_edu_200k_3_clusters

Viewer • Updated Aug 11, 2024 • 100k • 103

anton-l/dclm_edu_200k_clusters

Viewer • Updated Aug 11, 2024 • 100k • 98

anton-l/stanford_prompts_1M_rag

Viewer • Updated Mar 28, 2024 • 50k • 38 • 2

anton-l/math_fw_sample

Viewer • Updated Mar 21, 2024 • 48

anton-l/wiki-embed-mxbai-embed-large-v1

Viewer • Updated Mar 19, 2024 • 19.4M • 471

anton-l/wiki-chunked-mxbai-embed-large-v1

Viewer • Updated Mar 18, 2024 • 2.64M • 410

anton-l/wiki_embeddings

Viewer • Updated Mar 15, 2024 • 58.7k • 59