29 418 19

Fangyuan Yu PRO

Ksgk-fy

fangyuan-ksgk

AI & ML interests

AGI

Recent Activity

updated a collection 4 days ago

Cognition

updated a collection 7 days ago

Cognition

liked a model 8 days ago

SmallDoge/Doge-20M

View all activity

Organizations

Ksgk-fy's activity

upvoted a collection 12 days ago

Image / Video Gen

Collection

Image Generation Using Diffusion-Based Methods: Tips and Techniques for Stable Diffusion • 36 items • Updated 13 days ago • 9

upvoted a paper 15 days ago

Scaling LLM Pre-training with Vocabulary Curriculum

Paper • 2502.17910 • Published 17 days ago • 1

upvoted 2 papers 16 days ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published 16 days ago • 68

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published 22 days ago • 66

upvoted 4 papers 18 days ago

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published 28 days ago • 34

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published 23 days ago • 66

ActionPiece: Contextually Tokenizing Action Sequences for Generative Recommendation

Paper • 2502.13581 • Published 22 days ago • 5

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published 21 days ago • 60

upvoted a paper 22 days ago

Autellix: An Efficient Serving Engine for LLM Agents as General Programs

Paper • 2502.13965 • Published 22 days ago • 18

upvoted a paper 28 days ago

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28 • 26

upvoted 2 papers 29 days ago

CoS: Chain-of-Shot Prompting for Long Video Understanding

Paper • 2502.06428 • Published Feb 10 • 10

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published about 1 month ago • 47

upvoted a paper about 1 month ago

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published Feb 4 • 61

upvoted an article about 1 month ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.16k

upvoted a paper about 2 months ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published Jan 18 • 24

upvoted 4 papers 2 months ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published Jan 8 • 86

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 263

LTX-Video: Realtime Video Latent Diffusion

Paper • 2501.00103 • Published Dec 30, 2024 • 42

Training Software Engineering Agents and Verifiers with SWE-Gym

Paper • 2412.21139 • Published Dec 30, 2024 • 22

upvoted a paper 3 months ago

Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation

Paper • 2412.04432 • Published Dec 5, 2024 • 16