@Lunzima on HF中国镜像站: "I'm currently experimenting with the SFT dataset…"

HF中国镜像站

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

Lunzima

posted an update 2 days ago

Post

1188

I'm currently experimenting with the SFT dataset Lunzima/alpaca_like_dataset to further boost the performance of NQLSG-Qwen2.5-14B-MegaFusion-v9.x. This includes data sourced from DeepSeek-R1 or other cleaned results (excluding CoTs). Additionally, datasets that could potentially enhance the model's performance in math and programming/code, as well as those dedicated to specific uses like Swahili, are part of the mix.
@sometimesanotion @sthenno @wanlige

Lunzima

2 days ago

I don't know if the performance of Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v9.2 has improved or regressed because https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard/ is stuck.

In this post

Lunzima Lun Zima