1 5 6

Hugo Larcher

hlarcher

AI & ML interests

None yet

Recent Activity

reacted to fdaudens's post with 🔥 1 day ago

🔥The Open R1 team just dropped OlympicCoder and it's wild: - 7B model outperforms Claude 3.7 Sonnet on IOI benchmark (yes, 7B!!) - 32B crushes all open-weight models tested, even those 100x larger 🤯 Open-sourcing the future of code reasoning! 🚀 Check it out https://huggingface.co/blog/open-r1/update-3

updated a model 21 days ago

hlarcher/opt-125m-lfs

published a model 21 days ago

hlarcher/opt-125m-lfs

View all activity

Organizations

Posts 1

Post

1080

We are introducing multi-backend support in HF中国镜像站 Text Generation Inference!
With new TGI architecture we are now able to plug new modeling backends to get best performances according to selected model and available hardware. This first step will very soon be followed by the integration of new backends (TRT-LLM, llama.cpp, vLLM, Neuron and TPU).

We are polishing the TensorRT-LLM backend which achieves impressive performances on NVIDIA GPUs, stay tuned 🤗 !

Check out the details: https://huggingface.co/blog/tgi-multi-backend