Foundation Text-Generation Models Below 360M Parameters Collection Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. β’ 35 items β’ Updated 11 days ago β’ 31
Trained Models ποΈ Collection They may be small, but they're training like giants! β’ 8 items β’ Updated Dec 3, 2024 β’ 20
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published Feb 4 β’ 214