Nanotron Research

community

Activity Feed Request to join this org

AI & ML interests

Large scale distributed AI model training, model parallelisation, low-level GPU acceleration, make GPUs go brrrrr

Recent Activity

nouamanetazi updated a dataset about 21 hours ago

nanotron/ultrascale-playbook-data

nouamanetazi updated a Space about 23 hours ago

nanotron/predict_memory

nouamanetazi new activity 1 day ago

nanotron/ultrascale-playbook:Few Errors

View all activity

Organization Card

Community About org cards

The Nanotron team focus on sharing open knowledge and developping open-source libraries for efficient distributed training of large-scale AI models.

Some of its contributions are:

the Nanotron library
the Picotron library
the Ultrascale-Playbook, a comprehensive book covering all distributed/parallelisation and low-level techniques that can be used to efficiently train models at the largest scales.

spaces 2

pinned

Running

Predict Memory

🧮

Calculate memory usage from model configurations

pinned

Running

2.23k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

models 14

datasets 13

nanotron/ultrascale-playbook-data

Updated about 21 hours ago • 131 • 3

nanotron/minipile_100_samples

Viewer • Updated Jul 10, 2024 • 100 • 63 • 1

nanotron/llama3-1024-passkey-retrieval-eval

Viewer • Updated Jul 4, 2024 • 12.6k • 113

nanotron/llama3-16k-passkey-retrieval-finetuning

Viewer • Updated Jun 20, 2024 • 77.3k • 97

nanotron/llama3-16k-passkey-retrieval-eval

Viewer • Updated Jun 19, 2024 • 712 • 71

nanotron/llama3_needle_16k_finetuning

Viewer • Updated Jun 15, 2024 • 3.57k • 72

nanotron/needle_32k_eval_dataset

Viewer • Updated May 29, 2024 • 1.79k • 68 • 1

nanotron/needle_32k_finetuning_dataset

Viewer • Updated May 16, 2024 • 35.5k • 127

nanotron/needle_in_a_hay_stack_finetuning_dataset

Viewer • Updated May 14, 2024 • 21 • 73

nanotron/needle_in_a_hay_stack_eval_dataset

Viewer • Updated May 14, 2024 • 1 • 82 • 1

AI & ML interests

Recent Activity

Team members 12

spaces 2 Sort: Recently updated

Predict Memory

The Ultra-Scale Playbook

models 14 Sort: Recently updated

datasets 13 Sort: Recently updated

spaces 2

models 14

datasets 13