Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
merve 
posted an update Jan 31
Post
3876
This week in open AI was 🔥 Let's recap! 🤗 merve/january-31-releases-679a10669bd4030090c5de4d
LLMs 💬
> Huge: AllenAI released new Tülu models that outperform DeepSeek R1 using Reinforcement Learning with Verifiable Reward (RLVR) based on Llama 3.1 405B 🔥
> Mistral AI is back to open-source with their "small" 24B models (base & SFT), with Apache 2.0 license 😱
> Alibaba Qwen released their 1M context length models Qwen2.5-Instruct-1M, great for agentic use with Apache 2.0 license 🔥
> Arcee AI released Virtuoso-medium, 32.8B LLMs distilled from DeepSeek V3 with dataset of 5B+ tokens
> Velvet-14B is a new family of 14B Italian LLMs trained on 10T tokens in six languages
> OpenThinker-7B is fine-tuned version of Qwen2.5-7B-Instruct on OpenThoughts dataset

VLMs & vision 👀
> Alibaba Qwen is back with Qwen2.5VL, amazing new capabilities ranging from agentic computer use to zero-shot localization 🔥
> NVIDIA released new series of Eagle2 models with 1B and 9B sizes
> DeepSeek released Janus-Pro, new any-to-any model (image-text generation from image-text input) with MIT license
> BEN2 is a new background removal model with MIT license!

Audio 🗣️
> YuE is a new open-source music generation foundation model, lyrics-to-song generation

Codebase 👩🏻‍💻
> We are open-sourcing our SmolVLM training and eval codebase! https://github.com/huggingface/smollm/tree/main/vision
> Open-R1 is open-source reproduction of R1 by @huggingface science team https://huggingface.co/blog/open-r1

This week in "Open"-AI was 🔥 Let's recap! 🤗
merve/january-31-releases-679a10669bd4030090c5de4d
LLMs 💬
Huge: AllenAI released new Tülu models that outperform DeepSeek R1 using Reinforcement Learning with Verifiable Reward (RLVR) based on Llama 3.1 405B

Tülu models are based on proprietary Llama license, thus do not fit into the same category as DeepSeek which is truly free software, and which suddenly helped so many other companies.

You can't be comparing apples with oranges. Yes, you can, but it is obvious huge difference and person comparing it doesn't get right kudos he/she wanted to get.

AllenAI pretending to be "Open" sadly, joining companies like META to deceit and betray and enter into the community, it is type of deceitful propaganda where words such as "Open" are not protected, but rather tend to attract good portion of oblivious community.

Proprietary software category cannot be compared to free software category.

DeepSeek has reached the popularity for reason it is free software.

Infecting the AI space with proprietary software like Tülu by AllenAI to me looks like US propaganda against China.

I liked AllenAI truthfully, but now I see how much tricky they are, I feel deeply hurt by that betrayal.

References:

Meta’s LLaMa 2 license is not Open Source – Open Source Initiative:
https://opensource.org/blog/metas-llama-2-license-is-not-open-source

The Open Source Definition – Open Source Initiative:
https://opensource.org/osd

What is Free Software? - GNU Project - Free Software Foundation:
https://www.gnu.org/philosophy/free-sw.html

Word "Open" as in "Open Source" - Words to Avoid (or Use with Care) Because They Are Loaded or Confusing:
https://www.gnu.org/philosophy/words-to-avoid.html#Open

In this post