HF中国镜像站

Michael Feldman's picture

25 228

Michael Feldman

mfeldman143

·

AI & ML interests

None yet

Recent Activity

liked a model about 12 hours ago

sesame/csm-1b

liked a Space 1 day ago

depth-anything/PromptDA

liked a Space 1 day ago

depth-anything/Video-Depth-Anything

View all activity

Organizations

mfeldman143's activity

upvoted a paper 1 day ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published 8 days ago • 79

upvoted a paper 4 days ago

Forgetting Transformer: Softmax Attention with a Forget Gate

Paper • 2503.02130 • Published 10 days ago • 27

upvoted a collection 17 days ago

pi0_models

1 item • Updated Feb 4 • 9

upvoted a collection 19 days ago

Ovis2

Our latest advancement in multi-modal large language models (MLLMs) • 8 items • Updated about 22 hours ago • 55

upvoted a collection 20 days ago

SigLIP2

36 items • Updated 2 days ago • 62

upvoted a paper 21 days ago

Craw4LLM: Efficient Web Crawling for LLM Pretraining

Paper • 2502.13347 • Published 23 days ago • 27

upvoted a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 203

upvoted an article about 1 month ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 113

upvoted 2 collections about 1 month ago

Models, Jan 27

12 items • Updated Jan 27 • 1

Sapiens

Foundation models for human tasks. Code: https://github.com/facebookresearch/sapiens • 72 items • Updated Sep 18, 2024 • 57

upvoted a collection about 2 months ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 16 days ago • 107

upvoted a paper about 2 months ago

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published Jan 11 • 29

upvoted 3 papers 2 months ago

PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models

Paper • 2402.01118 • Published Feb 2, 2024 • 31

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 263

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 70

upvoted 2 collections 2 months ago

NeMo Audio Codecs

A series of Neural Audio Codecs • 5 items • Updated Jan 17 • 11

Cosmos

The collection of Cosmos models • 31 items • Updated Jan 17 • 269

upvoted a paper 3 months ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 111

upvoted a collection 3 months ago

Llama 3.3 (All Versions)

Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 2 days ago • 36

upvoted a paper 4 months ago

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published Nov 17, 2024 • 10