pinned
Running
54
Predict Memory
🧮
Calculate memory usage from model configurations
Large scale distributed AI model training, model parallelisation, low-level GPU acceleration, make GPUs go brrrrr
The Nanotron team focus on sharing open knowledge and developping open-source libraries for efficient distributed training of large-scale AI models.
Some of its contributions are: