10 11 23

Shengyi Costa Huang

vwxyzjn

http://costa.sh

AI & ML interests

None yet

Recent Activity

updated a model 1 minute ago

allenai/OLMoE-1B-7B-0125-Instruct

updated a model 2 minutes ago

allenai/OLMo-2-1124-13B-Instruct-RLVR2

updated a model 3 minutes ago

allenai/OLMo-2-1124-13B-Instruct

View all activity

Organizations

Articles 5

Article

118

How NuminaMath Won the 1st AIMO Progress Prize

Article

Preference Optimization for Vision Language Models

View all Articles

Collections 4

models 393

vwxyzjn/ppo_async

Updated Feb 5 • 31

vwxyzjn/ppo_sync

Updated Feb 5 • 50

vwxyzjn/online_dpo_sync

Updated Feb 5 • 35

vwxyzjn/online_dpo_async

Updated Feb 5 • 35

vwxyzjn/rm_zephyr_new

Text Classification • Updated Sep 26, 2024 • 16

vwxyzjn/online_dpo_vllm_thread_beta_0.03__allenai_open_instruct_dev

Updated Sep 11, 2024

vwxyzjn/reward_modeling__EleutherAI_pythia-14m

Updated Aug 24, 2024 • 15

vwxyzjn/online_dpo_vllm__vwxyzjn_btulu

Updated Aug 23, 2024 • 11

vwxyzjn/online_dpo_vllm__allenai_llama-3-tulu-2-8b

Updated Aug 19, 2024 • 14

vwxyzjn/btulu

Text Generation • Updated Aug 19, 2024 • 9

datasets 287

vwxyzjn/multiplication_train_1000_2x2-gsm8k-verifier

Viewer • Updated 3 days ago • 1k • 27

vwxyzjn/rlvr-mult-gsm8k-verifier

Viewer • Updated 4 days ago • 5.1k • 41

vwxyzjn/rlvr_acecoder

Viewer • Updated 27 days ago • 87.1k • 96

vwxyzjn/old-tulu-3-mix-pref-dataset

Viewer • Updated Jan 21 • 149k • 121

vwxyzjn/old-tulu-3-mix-dataset

Viewer • Updated Jan 21 • 934k • 115

vwxyzjn/norobot_pref_4860

Viewer • Updated Oct 2, 2024 • 59.9k • 178

vwxyzjn/norobot_generation_4860

Viewer • Updated Oct 2, 2024 • 29.9k • 160

vwxyzjn/norobot_pref_465

Viewer • Updated Oct 2, 2024 • 59.4k • 154

vwxyzjn/norobot_generation_465

Viewer • Updated Oct 2, 2024 • 29.7k • 217

vwxyzjn/norobot_generation_16325

Viewer • Updated Oct 2, 2024 • 29.7k • 353

Shengyi Costa Huang

AI & ML interests

Recent Activity

Organizations

Articles 5

How NuminaMath Won the 1st AIMO Progress Prize

Preference Optimization for Vision Language Models

Collections 4

Papers 10

spaces 4 Sort: Recently updated

Test

Aim

Vwxyzjn Testyes4

Pyserini Wikipedia Kilt Doc

models 393 Sort: Recently updated

datasets 287 Sort: Recently updated

spaces 4

models 393

datasets 287