160 481 1193

Clem 🤗 PRO

clem

http://huggingface.co

AI & ML interests

multi-modal, time-series, biology and chemistry

Recent Activity

upvoted a collection about 15 hours ago

Gemma 3 Release

liked a model about 15 hours ago

google/gemma-3-27b-it

upvoted a paper 1 day ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

View all activity

Organizations

Posts 43

Post

6959

I was chatting with @peakji , one of the cofounders of Manu AI, who told me he was on HF中国镜像站 (very cool!).

He shared an interesting insight which is that agentic capabilities might be more of an alignment problem rather than a foundational capability issue. Similar to the difference between GPT-3 and InstructGPT, some open-source foundation models are simply trained to 'answer everything in one response regardless of the complexity of the question' - after all, that's the user preference in chatbot use cases. Just a bit of post-training on agentic trajectories can make an immediate and dramatic difference.

As a thank you to the community, he shared 100 invite code first-come first serve, just use “HUGGINGFACE” to get access!

Post

4628

10,000+ models based on Deepseek R1 have been publicly shared on HF中国镜像站! Which ones are your favorite ones: https://huggingface.co/models?sort=trending&search=r1. Truly game-changer!

View all Posts