24 51 189

Théo Gigant

gigant

https://giganttheo.github.io/

AI & ML interests

multimodal

Recent Activity

upvoted an article 1 day ago

Open R1: Update #3

upvoted an article 1 day ago

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

liked a model 2 days ago

HuggingFaceM4/idefics2-8b-chatty

View all activity

Organizations

gigant's activity

upvoted 2 articles 1 day ago

Article

Open R1: Update #3

and 9 others •

2 days ago

• 190

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

2 days ago

• 197

upvoted a paper 3 days ago

EuroBERT: Scaling Multilingual Encoders for European Languages

Paper • 2503.05500 • Published 6 days ago • 71

upvoted an article 3 days ago

Article

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

and 3 others •

4 days ago

• 115

upvoted an article 9 days ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

10 days ago

• 65

upvoted a paper 20 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 22 days ago • 162

upvoted an article 20 days ago

Article

SigLIP 2: A better multilingual vision language encoder

21 days ago

• 133

upvoted a paper 28 days ago

Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning

Paper • 2502.06533 • Published Feb 10 • 18

upvoted a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 203

upvoted an article about 1 month ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 803

upvoted an article 3 months ago

Article

EuroLLM-9B

and 5 others •

Dec 2, 2024

• 113

upvoted a paper 6 months ago

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24, 2024 • 26

upvoted 4 papers 7 months ago

Contextual Position Encoding: Learning to Count What's Important

Paper • 2405.18719 • Published May 29, 2024 • 5

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 126

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 82

Harvesting Textual and Structured Data from the HAL Publication Repository

Paper • 2407.20595 • Published Jul 30, 2024 • 22

upvoted a paper 8 months ago

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

Paper • 2406.11271 • Published Jun 17, 2024 • 21

upvoted 3 articles 8 months ago

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18, 2024

• 72

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 332

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 214