222 599 459

Adina Yakefu

AdinaY

AI & ML interests

None yet

Recent Activity

updated a Space 1 minute ago

zh-ai-community/china-ai-policy-research

reacted to clem's post with 🚀 about 3 hours ago

We just crossed 1,500,000 public models on HF中国镜像站 (and 500k spaces, 330k datasets, 50k papers). One new repository is created every 15 seconds. Congratulations all!

reacted to clem's post with 🤗 about 3 hours ago

We just crossed 1,500,000 public models on HF中国镜像站 (and 500k spaces, 330k datasets, 50k papers). One new repository is created every 15 seconds. Congratulations all!

View all activity

Organizations

AdinaY's activity

updated a Space 1 minute ago

China AI policy research 🤗

🔥

Explore China's AI policy updates

reacted to clem's post with 🚀🤗 about 3 hours ago

Post

829

We just crossed 1,500,000 public models on HF中国镜像站 (and 500k spaces, 330k datasets, 50k papers). One new repository is created every 15 seconds. Congratulations all!

1 reply

upvoted a collection about 16 hours ago

🎬 Video model 2025

Collection

6 items • Updated about 16 hours ago • 3

posted an update about 19 hours ago

Post

626

SEA-VL🔥 an OPEN dataset bridge the AI culture gap in Southeast Asia!

Paper: Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia (2503.07920)

✨1.28M+ culturally relevant images
✨85% accuracy in auto-collected images
✨Tracking underrepresented SEA languages & traditions

upvoted a paper 1 day ago

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Paper • 2503.07703 • Published 4 days ago • 29

posted an update 1 day ago

Post

876

Open Sora 2.0 is out 🔥
hpcai-tech/open-sora-20-67cfb7efa80a73999ccfc2d5
✨ 11B with Apache2.0
✨ Low training cost - $200k
✨ open weights, code and training workflow

updated 2 collections 1 day ago

📊 Dataset

Collection

7 items • Updated 1 day ago • 1

2025 March

Collection

10 items • Updated 1 day ago

liked a model 1 day ago

hpcai-tech/Open-Sora-v2

Updated 2 days ago • 32

updated 2 collections 1 day ago

2025 March

Collection

10 items • Updated 1 day ago

🎬 Video model 2025

Collection

6 items • Updated about 16 hours ago • 3

upvoted a collection 1 day ago

Open-Sora 2.0

Collection

3 items • Updated 2 days ago • 6

upvoted 3 papers 2 days ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published 4 days ago • 31

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published 3 days ago • 54

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published 3 days ago • 89

upvoted 2 articles 2 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

2 days ago

• 232

Article

Open R1: Update #3

and 9 others •

3 days ago

• 207

reacted to clefourrier's post with 🚀 2 days ago

Post

1498

Gemma3 family is out! Reading the tech report, and this section was really interesting to me from a methods/scientific fairness pov.

Instead of doing over-hyped comparisons, they clearly state that **results are reported in a setup which is advantageous to their models**.
(Which everybody does, but people usually don't say)

For a tech report, it makes a lot of sense to report model performance when used optimally!
On leaderboards on the other hand, comparison will be apples to apples, but in a potentially unoptimal way for a given model family (like some user interact sub-optimally with models)

Also contains a cool section (6) on training data memorization rate too! Important to see if your model will output the training data it has seen as such: always an issue for privacy/copyright/... but also very much for evaluation!

Because if your model knows its evals by heart, you're not testing for generalization.

reacted to thomwolf's post with 🚀 2 days ago

Post

1822

We've kept pushing our Open-R1 project, an open initiative to replicate and extend the techniques behind DeepSeek-R1.

And even we were mind-blown by the results we got with this latest model we're releasing: ⚡️OlympicCoder ( open-r1/OlympicCoder-7B and open-r1/OlympicCoder-32B)

It's beating Claude 3.7 on (competitive) programming –a domain Anthropic has been historically really strong at– and it's getting close to o1-mini/R1 on olympiad level coding with just 7B parameters!

And the best part is that we're open-sourcing all about its training dataset, the new IOI benchmark, and more in our Open-R1 progress report #3: https://huggingface.co/blog/open-r1/update-3

Datasets are are releasing:
- open-r1/codeforces
- open-r1/codeforces-cots
- open-r1/ioi
- open-r1/ioi-test-cases
- open-r1/ioi-sample-solutions
- open-r1/ioi-cots
- open-r1/ioi-2024-model-solutions