HF中国镜像站

Sugato Ray's picture

Sugato Ray PRO

sugatoray

·

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

updated a collection about 23 hours ago

Papers-Fundamentals

updated a collection about 23 hours ago

upvoted a paper about 23 hours ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

View all activity

Organizations

sugatoray's activity

upvoted a paper about 23 hours ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published 1 day ago • 41

upvoted 2 papers 1 day ago

HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs

Paper • 2503.02003 • Published 11 days ago • 42

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published 4 days ago • 31

upvoted a collection 1 day ago

👩‍💻 OlympicCoder

Reasoning datasets and models for competitive coding • 4 items • Updated 3 days ago • 8

upvoted an article 1 day ago

Article

Open R1: Update #3

By

and 9 others •

3 days ago

• 207

upvoted an article 2 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

2 days ago

• 232

upvoted 2 collections 2 days ago

Gemma 3 Release

9 items • Updated about 14 hours ago • 229

Gemma 3

All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated about 2 hours ago • 30

upvoted a paper 2 days ago

LettuceDetect: A Hallucination Detection Framework for RAG Applications

Paper • 2502.17125 • Published 18 days ago • 8

upvoted an article 2 days ago

Article

LettuceDetect: A Hallucination Detection Framework for RAG Applications

By

•

14 days ago

• 7

upvoted 3 papers 2 days ago

A Survey on Post-training of Large Language Models

Paper • 2503.06072 • Published 6 days ago • 1

R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning

Paper • 2503.05379 • Published 7 days ago • 30

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published 23 days ago • 32

upvoted a paper 4 days ago

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

Paper • 2503.04697 • Published 8 days ago • 2

upvoted an article 4 days ago

Article

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

By

and 3 others •

4 days ago

• 117

upvoted a paper 4 days ago

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published 7 days ago • 25

upvoted a collection 4 days ago

Benchmarks

81 items • Updated 3 days ago • 2

upvoted a paper 4 days ago

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Paper • 2502.12115 • Published 25 days ago • 43

upvoted 2 papers 6 days ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Paper • 2503.01935 • Published 11 days ago • 24

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

Paper • 2503.01307 • Published 11 days ago • 31