1 1 14

wang

zhaokai

gklab

AI & ML interests

None yet

Recent Activity

liked a dataset 29 days ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

liked a Space about 1 month ago

nanotron/ultrascale-playbook

liked a model 2 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

View all activity

Organizations

zhaokai's activity

liked a dataset 29 days ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

Viewer • Updated Feb 21 • 110k • 7.77k • 588

liked a Space about 1 month ago

2.34k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked 2 models 2 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 30 days ago • 1.86M • • 1.29k

deepseek-ai/DeepSeek-R1

Text Generation • Updated 30 days ago • 1.5M • • 11.6k

liked a model 3 months ago

deepseek-ai/DeepSeek-V3-Base

Updated 30 days ago • 413k • 1.61k

upvoted a collection 6 months ago

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 209

liked 2 models 7 months ago

microsoft/Phi-3.5-MoE-instruct

Text Generation • Updated 18 days ago • 42.4k • • 556

Qwen/Qwen2-Audio-7B-Instruct

Audio-Text-to-Text • Updated Jan 12 • 103k • • 391

liked a model 8 months ago

meta-llama/Prompt-Guard-86M

Text Classification • Updated Jul 25, 2024 • 34.2k • • 244

liked a model 10 months ago

openbmb/MiniCPM-Llama3-V-2_5

Image-Text-to-Text • Updated Jan 15 • 25.4k • 1.39k

liked a Space 10 months ago

893

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a dataset about 1 year ago

Skywork/SkyPile-150B

Viewer • Updated Dec 7, 2023 • 1.76M • 3.5k • 365

New activity in SkunkworksAI/phi-2 over 1 year ago

Update config.json

#7 opened over 1 year ago by

zhaokai

liked 3 models over 1 year ago