Model assets for the first Mixture-of-Lora technique applied to Llama. https://bit.ly/48bqshl
maxine
crumb
AI & ML interests
small models
Recent Activity
updated
a dataset
about 14 hours ago
crumb/minus1126054896653180804_1to8192_64K
published
a dataset
about 14 hours ago
crumb/minus1126054896653180804_1to8192_64K
liked
a dataset
about 1 month ago
HuggingFaceH4/MATH
Organizations
Collections
5
models
149

crumb/d87b8c-6h-128d
Text Generation
•
Updated
•
58

crumb/d3747f-12h-64d
Text Generation
•
Updated
•
60

crumb/ce7cf9-8h-96d
Text Generation
•
Updated
•
62

crumb/d38a14-32h-16d
Text Generation
•
Updated
•
65

crumb/3291e9-16h-32d
Text Generation
•
Updated
•
63

crumb/f358a4-8h-64d
Text Generation
•
Updated
•
65

crumb/efa25a-4h-128d
Text Generation
•
Updated
•
75

crumb/7457fb-2h-256d
Text Generation
•
Updated
•
69

crumb/GLORT2
Text Generation
•
Updated
•
56

crumb/160m-plus-sauce
Text Generation
•
Updated
•
161
datasets
52
crumb/minus1126054896653180804_1to8192_64K
Viewer
•
Updated
•
65.5k
•
2
crumb/onebench
Viewer
•
Updated
•
7.52k
•
79
crumb/askmistral-pile-2-15
Viewer
•
Updated
•
2.34M
•
343
•
10
crumb/songbird
Viewer
•
Updated
•
12M
•
13
crumb/semantic-corruption-t5-v1_1-small
Viewer
•
Updated
•
2.28k
•
66
crumb/semantic-corruption-t5-v1_1-base
Viewer
•
Updated
•
1.54k
•
69
crumb/semantic-corruption-t5-v1_1-large
Viewer
•
Updated
•
753
•
83
crumb/reup-test
Viewer
•
Updated
•
2k
•
9
crumb/redbeam-c4-rated-vit-l14-laion
Updated
•
5
crumb/dummy-cot-sampling-dataset-clean-preview
Updated
•
10