Haofan Wang

wanghaofan

AI & ML interests

Co-Founder&Researcher@InstantX

Recent Activity

Organizations

Stable Diffusion Dreambooth Concepts Library's profile picture ZeroGPU Explorers's profile picture InstantX's profile picture Social Post Explorers's profile picture Shakker Labs's profile picture

wanghaofan's activity

reacted to AdinaY's post with 🔥 17 days ago
view post
Post
2466
Two AI startups, DeepSeek & Moonshot AI , keep moving in perfect sync 👇

✨ Last December: DeepSeek & Moonshot AI released their reasoning models on the SAME DAY.
DeepSeek: deepseek-ai/DeepSeek-R1
MoonShot: https://github.com/MoonshotAI/Kimi-k1.5

✨ Last week: Both teams published papers on modifying attention mechanisms on the SAME DAY AGAIN.
DeepSeek: Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention (2502.11089)
Moonshot: MoBA: Mixture of Block Attention for Long-Context LLMs (2502.13189)

✨ TODAY:
DeepSeek unveiled Flash MLA: a efficient MLA decoding kernel for NVIDIA Hopper GPUs, optimized for variable-length sequences.
https://github.com/deepseek-ai/FlashMLA

Moonshot AI introduces Moonlight: a 3B/16B MoE trained on 5.7T tokens using Muon, pushing the Pareto frontier with fewer FLOPs.
moonshotai/Moonlight-16B-A3B

What's next? 👀
New activity in CSU-JPG/TextAtlas5M 22 days ago

Update README.md

#3 opened 22 days ago by
wanghaofan
updated a Space 26 days ago