Joao Pedro Silva Dias Moura Mesquita
inkasaras
AI & ML interests
None yet
Recent Activity
liked
a model
about 2 hours ago
open-r1/OlympicCoder-7B
upvoted
an
article
about 2 hours ago
Open R1: Update #3
upvoted
a
paper
about 10 hours ago
Learning from Failures in Multi-Attempt Reinforcement Learning
Organizations
None yet
Collections
4
models
20
inkasaras/Mistral_ecommerce_v3
Text Generation
•
Updated
•
80
•
1
inkasaras/Mistral_ecommerce_v2
Text Generation
•
Updated
•
78
inkasaras/Mistral_ecommerce
Text Generation
•
Updated
•
82
•
1
inkasaras/ppo-LunarLanding-v2
Reinforcement Learning
•
Updated
inkasaras/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
inkasaras/poca-SoccerTwosv2
Reinforcement Learning
•
Updated
•
26
inkasaras/poca-SoccerTwos
Updated
inkasaras/Pyramids
Reinforcement Learning
•
Updated
•
28
inkasaras/ppo-SnowballTargetv2
Reinforcement Learning
•
Updated
•
25
inkasaras/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
38