HF中国镜像站

hysts's picture

hysts

hysts

·

AI & ML interests

Computer Vision

Recent Activity

upvoted an article about 17 hours ago

Open R1: Update #3

new activity about 19 hours ago

MaverickAlex/R-FLAV:Apply for community grant: Academic project (gpu)

updated a Space 1 day ago

huggingface-projects/gemma-3-12b-it

View all activity

Organizations

hysts's activity

upvoted an article about 17 hours ago

Article

Open R1: Update #3

By

and 9 others •

2 days ago

• 186

upvoted an article 1 day ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

2 days ago

• 196

upvoted an article 8 days ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

10 days ago

• 65

upvoted an article 16 days ago

Article

FastRTC: The Real-Time Communication Library for Python

17 days ago

• 141

upvoted an article 28 days ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

By

and 1 other •

about 1 month ago

• 26

upvoted 2 articles 29 days ago

Article

Open R1: Update #2

By

and 6 others •

Feb 10

• 202

Article

Open-R1: Update #1

By

and 7 others •

Feb 2

• 295

upvoted 4 articles about 1 month ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.16k

Article

The AI tools for Art Newsletter - Issue 1

Jan 31

• 70

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 803

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 429

upvoted a collection 6 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 227

upvoted a paper 10 months ago

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 88

upvoted a paper about 1 year ago

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21, 2024 • 115

upvoted 6 papers over 1 year ago

ChatAnything: Facetime Chat with LLM-Enhanced Personas

Paper • 2311.06772 • Published Nov 12, 2023 • 35

Music ControlNet: Multiple Time-varying Controls for Music Generation

Paper • 2311.07069 • Published Nov 13, 2023 • 44

Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models

Paper • 2311.06783 • Published Nov 12, 2023 • 28

I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

Paper • 2311.04145 • Published Nov 7, 2023 • 35

Learning From Mistakes Makes LLM Better Reasoner

Paper • 2310.20689 • Published Oct 31, 2023 • 29

CapsFusion: Rethinking Image-Text Data at Scale

Paper • 2310.20550 • Published Oct 31, 2023 • 26