FUfu99/Eurus-2-7B-PRIME-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated 11 days ago • 27
FUfu99/Qwen2.5-7B-Instruct-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated 11 days ago • 26
FUfu99/deepseek-math-7b-rl-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated 18 days ago • 15
FUfu99/DeepSeek-R1-Distill-Qwen-1.5B-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated 18 days ago • 127
FUfu99/DeepSeek-R1-Distill-Qwen-7B-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated 18 days ago • 12
FUfu99/Qwen-2.5-Math-7B-SimpleRL-Zero-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated 21 days ago • 48
FUfu99/Qwen-2.5-Math-7B-SimpleRL-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated 21 days ago • 29
FUfu99/deepseek-math-7b-instruct-best_of_n-VLLM-Skywork-o1-Open-PRM-Qwen-2.5-7B-completions Updated 21 days ago • 16