HF中国镜像站

Andres Marafioti's picture

Andres Marafioti

andito

·

AI & ML interests

Multimodal models, VLM and TTS

Recent Activity

liked a model 1 day ago

deepseek-ai/deepseek-vl2-tiny

liked a Space 2 days ago

Kwai-Kolors/Kolors-Virtual-Try-On

liked a Space 2 days ago

jallenjia/Change-Clothes-AI

View all activity

Organizations

andito's activity

liked a model 1 day ago

deepseek-ai/deepseek-vl2-tiny

Image-Text-to-Text • Updated Dec 18, 2024 • 81k • 165

liked 2 Spaces 2 days ago

Kolors Virtual Try-On

Upload images to try on clothes virtually

Change Clothes AI

AI Clothes Changer Online

upvoted an article 7 days ago

Article

Replicating DeepSeek R1 for Information Extraction

By

•

Jan 31

• 38

liked a dataset 8 days ago

fixie-ai/gigaspeech

Viewer • Updated Sep 7, 2024 • 16.6M • 13k • 4

posted an update 8 days ago

Post

2425

Extremely bullish on @CohereForAI 's Aya Vision (8B & 32B) - new SOTA open-weight VLMs

- 8B wins up to 81% of the time in its class, better than Gemini Flash
- 32B beats Llama 3.2 90B!
- Covers 23 languages, excels in image captioning, VQA & more
- Integrated on transformers from Day 0!

Efficient multimodal models are here to stay!!🔥
Check out their blog! https://huggingface.co/blog/aya-vision

liked 2 models 8 days ago

CohereForAI/aya-vision-8b

Image-Text-to-Text • Updated 9 days ago • 147k • 250

HuggingFaceTB/SmolVLM2-256M-Video-Instruct

Image-Text-to-Text • Updated 7 days ago • 5.08k • 40

liked a Space 9 days ago

Di♪♪Rhythm

Blazingly Fast and Embarrassingly Simple Song Generation

liked a model 9 days ago

ASLP-lab/DiffRhythm-base

Updated about 2 hours ago • 133

upvoted an article 9 days ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

10 days ago

• 65

liked a model 21 days ago

HuggingFaceTB/SmolVLM2-2.2B-Instruct

Image-Text-to-Text • Updated 7 days ago • 503k • 110

liked a Space 21 days ago

SmolVLM

Generate text by analyzing images and videos

upvoted an article 21 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

22 days ago

• 205

published an article 22 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

22 days ago

• 205

liked a Space 22 days ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

authored a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 203

upvoted a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 203

upvoted an article about 1 month ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.16k

New activity in HuggingFaceTB/SmolVLM-256M-Instruct about 1 month ago

Add ONNX sample code

#8 opened about 1 month ago by