-
CLEAR: Character Unlearning in Textual and Visual Modalities
Paper • 2410.18057 • Published • 203 -
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation
Paper • 2410.23090 • Published • 54 -
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
Paper • 2410.23743 • Published • 62 -
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization
Paper • 2411.02355 • Published • 49

Omar Elcircevi
omarcevi
·
AI & ML interests
None yet
Organizations
Collections
1
models
9

omarcevi/ppo-Pyramids_Training
Reinforcement Learning
•
Updated
•
20

omarcevi/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
33

omarcevi/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated

omarcevi/Reinforce-CartPole1
Reinforcement Learning
•
Updated

omarcevi/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
3

omarcevi/q-Taxi-V3
Reinforcement Learning
•
Updated

omarcevi/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated

omarcevi/ppo-Huggy
Reinforcement Learning
•
Updated
•
70

omarcevi/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
5
datasets
None public yet