2 26 104

wangrui

varuy322

varuy322

AI & ML interests

None yet

Recent Activity

upvoted a collection about 2 hours ago

top-edu-hf-25m3

updated a collection about 2 hours ago

top-edu-hf-25m3

updated a collection about 2 hours ago

top-edu-hf-25m3

View all activity

Organizations

None yet

varuy322's activity

upvoted a collection about 2 hours ago

top-edu-hf-25m3

Collection

12 items • Updated about 2 hours ago • 3

updated a collection about 2 hours ago

top-edu-hf-25m3

Collection

12 items • Updated about 2 hours ago • 3

liked a model about 3 hours ago

allenai/OLMo-2-0325-32B-Instruct

Text Generation • Updated about 15 hours ago • 499 • 47

upvoted 2 papers about 12 hours ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published 4 days ago • 31

SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities

Paper • 2502.12025 • Published 25 days ago • 1

upvoted a paper 1 day ago

Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs

Paper • 2410.18451 • Published Oct 24, 2024 • 18

upvoted a collection 1 day ago

Skywork-Reward-Data-Collection

Collection

Open-source preference datasets used to train the Skywork reward model series • 17 items • Updated Oct 12, 2024 • 15

liked a Space 1 day ago

345

Reward Bench Leaderboard

📐

Explore and analyze RewardBench leaderboard data

liked a model 1 day ago

HuggingFaceTB/finemath-classifier

Text Classification • Updated Dec 19, 2024 • 27.3k • 9

liked a dataset 1 day ago

OpenCoder-LLM/opc-fineweb-code-corpus

Viewer • Updated Nov 24, 2024 • 101M • 2.02k • 40