Andres Marafioti's picture

Andres Marafioti

andito

AI & ML interests

Multimodal models, VLM and TTS

Recent Activity

liked a model 1 day ago
deepseek-ai/deepseek-vl2-tiny
liked a Space 2 days ago
Kwai-Kolors/Kolors-Virtual-Try-On
liked a Space 2 days ago
jallenjia/Change-Clothes-AI
View all activity

Organizations

HF中国镜像站's profile picture HuggingFaceM4's profile picture Huggingface Projects's profile picture HF中国镜像站 H4's profile picture HF中国镜像站 OSS Metrics's profile picture HF中国镜像站 TB Research's profile picture MLX Community's profile picture Distillation Hugs's profile picture Argilla Warehouse's profile picture HF中国镜像站 FineVideo's profile picture smol-explorers's profile picture HF中国镜像站 Science's profile picture Open R1's profile picture

andito's activity

upvoted an article 7 days ago
view article
Article

Replicating DeepSeek R1 for Information Extraction

By Ihor
38
posted an update 8 days ago
view post
Post
2425
Extremely bullish on @CohereForAI 's Aya Vision (8B & 32B) - new SOTA open-weight VLMs

- 8B wins up to 81% of the time in its class, better than Gemini Flash
- 32B beats Llama 3.2 90B!
- Covers 23 languages, excels in image captioning, VQA & more
- Integrated on transformers from Day 0!

Efficient multimodal models are here to stay!!🔥
Check out their blog! https://huggingface.co/blog/aya-vision
upvoted an article 9 days ago
view article
Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

65
upvoted an article 21 days ago
view article
Article

SmolVLM2: Bringing Video Understanding to Every Device

205
published an article 22 days ago
view article
Article

SmolVLM2: Bringing Video Understanding to Every Device

205
upvoted an article about 1 month ago
view article
Article

Open-source DeepResearch – Freeing our search agents

1.16k
New activity in HuggingFaceTB/SmolVLM-256M-Instruct about 1 month ago

Add ONNX sample code

#8 opened about 1 month ago by
Xenova