Reinforce learning 🔃 - a thehandsomefrog4825 Collection

HF中国镜像站

thehandsomefrog4825 's Collections

Object detection 🔍

VLM 👁️👁️

Object segmentation 🧩

Reinforce learning 🔃

GAN

Robotic 🤖🔧

TTI ⌨️➡️🖼️

TTS ⌨️➡️🗣️

TTV 📝➡️📺

Generative 🎨

Reinforce learning 🔃

updated Feb 9

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 92
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published Dec 9, 2024 • 71
The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 112
Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 55