HF中国镜像站

Mishig Davaadorj's picture

Mishig Davaadorj

mishig

·

AI & ML interests

NP-completeness, grammars, universality

Recent Activity

updated a Space 2 days ago

huggingface/inference-playground

updated a Space 7 days ago

lerobot/visualize_dataset

upvoted an article 13 days ago

Train 400x faster Static Embedding Models with Sentence Transformers

View all activity

Organizations

mishig's activity

updated a Space 2 days ago

Inference Playground

Engage in chat conversations

updated a Space 7 days ago

Visualize Dataset (v2.0+ latest dataset format)

Browse robotic datasets visually

upvoted 2 articles 13 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 159

Article

❤️ a love letter to the Open AI inference client

By

•

13 days ago

• 9

updated a Space 14 days ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

New activity in nanotron/ultrascale-playbook 14 days ago

Make hash section working

#89 opened 14 days ago by

upvoted an article 17 days ago

Article

Remote VAEs for decoding with HF endpoints 🤗

18 days ago

• 35

upvoted a paper 21 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 22 days ago • 161

upvoted a paper 22 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 25 days ago • 142

liked a Space 23 days ago

AI Podcast Generator

Generate Podcast using Kokoro-TTS!

liked a model 27 days ago

zed-industries/zeta

Updated 14 days ago • 2.4k • 228

upvoted a collection 29 days ago

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 21 days ago • 49

upvoted an article 29 days ago

Article

State of open video generation models in Diffusers

Jan 27

• 50

upvoted a paper about 1 month ago

DynVFX: Augmenting Real Videos with Dynamic Content

Paper • 2502.03621 • Published Feb 5 • 29

upvoted a collection about 1 month ago

Hibiki fr-en

Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated Feb 6 • 50

upvoted a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 203

updated a model about 1 month ago

simplescaling/s1-32B

Text Generation • Updated 15 days ago • 14.5k • 288

New activity in simplescaling/s1-32B about 1 month ago

Update README.md

#1 opened about 1 month ago by

upvoted a paper about 1 month ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 111

upvoted an article about 1 month ago

Article

Replicating DeepSeek R1 for Information Extraction

By

•

Jan 31

• 38