33 15 105

Yixin Song

yixinsong

AI & ML interests

None yet

Recent Activity

liked a dataset 7 days ago

epfml/FineWeb2-HQ

liked a dataset 14 days ago

bytedance-research/MAGACorpus

published a model 18 days ago

PowerInfer/OPT-7B-predictor

View all activity

Organizations

yixinsong's activity

New activity in PowerInfer/SmallThinker-3B-Preview about 2 months ago

Eval script

#9 opened about 2 months ago by

rawsh

New activity in PowerInfer/SmallThinker-3B-Preview 2 months ago

About the training details

#5 opened 2 months ago by

hiyouga

How to Pair with Larger Models

#7 opened 2 months ago by

windkkk

Prompt/token adjust to stop "Overthinking" in unnescissary cases

#6 opened 2 months ago by

fuzzy-mittenz

example use colab?

#3 opened 2 months ago by

NickyNicky

Update README.md

#4 opened 2 months ago by

AISafety

Training: Second Phase

#2 opened 2 months ago by

tugstugi

New activity in PowerInfer/QWQ-LONGCOT-500K 2 months ago

[bot] Conversion to Parquet

#1 opened 3 months ago by

parquet-converter

New activity in PowerInfer/LONGCOT-Refine-500K 2 months ago

[bot] Conversion to Parquet

#1 opened 2 months ago by

parquet-converter

Librarian Bot: Add language metadata for dataset

#2 opened 2 months ago by

librarian-bot

New activity in PowerInfer/SmallThinker-3B-Preview 2 months ago

Evaluation

#1 opened 2 months ago by

tugstugi

New activity in PowerInfer/TurboSparse-Mistral-Instruct 6 months ago

problems about sample strategies

#1 opened 6 months ago by

thuzhizhi

New activity in yixinsong/persona 7 months ago

[bot] Conversion to Parquet

#1 opened 7 months ago by

parquet-converter

New activity in BAAI/Infinity-Instruct 7 months ago

0729聊天数据集有计划开源吗？

#16 opened 7 months ago by

yixinsong

New activity in HuggingFaceTB/SmolLM-1.7B 8 months ago

MMLU doesn't match on lm-evaluation-harness

#2 opened 8 months ago by

yixinsong

New activity in SparseLLM/relu2-5B 9 months ago

Inference API not working properly. Lack of proper modeling file?

#1 opened 9 months ago by

xunkai55

New activity in SparseLLM/relu-5B 9 months ago

Difference between SparseLLM/relu and SparseLLM/reglu - lack of modeling file?

#1 opened 9 months ago by

xunkai55

commented 3 papers 9 months ago