view post Post 1560 Mini-QwQ an edge device friendly reasoning model distilled from QwQ-32B 🤗: kz919/QwQ-0.5B-Distilled-SFT🇬 🇬 🇺 🇫: kz919/QwQ-0.5B-Distilled-SFT-gguf🤖: kz919/Mini-QwQ See translation 👍 7 7 + Reply
Cautious Optimizers: Improving Training with One Line of Code Paper • 2411.16085 • Published Nov 25, 2024 • 19
view post Post 1518 Just for the meme.But the clear lesson I learnt from building these demos are, the more powerful the underlying base model is, the closer you will get to GPT4o1. CoT is nothing more than simply inducing the latent reasoning capability from the model. kz919/GPT4-O1-Proximas 🚀 6 6 🔥 2 2 😎 1 1 + Reply
view post Post 2456 "It's Sunday night, fancy a game?"https://kz919-can-you-beat-405b-in-chess.hf.space/built with the one and only SN fast API:https://sambanova.ai/fast-api?api_ref=907266 7 replies · 🧠 8 8 🔥 2 2 + Reply
view post Post 644 Good lord... Spent almost a day debugging this and it turns out it was an issue of gradio update incompatible with the new fastapi.https://discuss.huggingface.co/t/huggingface-space-failed-after-working-initially/105514/8Finally got it back online! Come chat with your favorite anime characters here: kz919/Persona-AI 👀 3 3 + Reply
view post Post 1595 Spent a few minutes to build an alternative to Character AI on top of llama3.1 405B through SambaNova's super fast inference API Space: kz919/Persona-AIAPI referral link: https://sambanova.ai/fast-api?api_ref=907266 3 replies · 🔥 3 3 😎 3 3 🚀 2 2 🤗 2 2 🤯 2 2 🧠 2 2 + Reply
view post Post 1695 The only 405B spaces still freely accessible are powered by SN fast api. xianbao/SambaNova-fasthttps://sambanova.ai/fast-api?api_ref=907266 👀 6 6 🔥 4 4 🤗 2 2 😎 1 1 + Reply
DataComp-LM: In search of the next generation of training sets for language models Paper • 2406.11794 • Published Jun 17, 2024 • 50
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts Paper • 2405.07518 • Published May 13, 2024 • 27
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts Paper • 2405.07518 • Published May 13, 2024 • 27
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts Paper • 2405.07518 • Published May 13, 2024 • 27
Communication Efficient Distributed Training with Distributed Lion Paper • 2404.00438 • Published Mar 30, 2024 • 2
Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts Paper • 2310.05898 • Published Oct 9, 2023 • 2