13 244 84

Sergei Averkiev

averoo

https://lingtra.in

averkij

AI & ML interests

None yet

Recent Activity

new activity about 12 hours ago

google/gemma-3-27b-it:Languages list

liked a model about 12 hours ago

google/gemma-3-27b-it

upvoted a collection about 12 hours ago

Gemma 3 Release

View all activity

Organizations

averoo's activity

New activity in google/gemma-3-27b-it about 12 hours ago

Languages list

#16 opened about 12 hours ago by

averoo

liked a model about 12 hours ago

google/gemma-3-27b-it

Image-Text-to-Text • Updated 1 day ago • 38.5k • 466

upvoted a collection about 12 hours ago

Gemma 3 Release

Collection

9 items • Updated 1 day ago • 210

upvoted a paper 1 day ago

AI-native Memory 2.0: Second Me

Paper • 2503.08102 • Published 3 days ago • 6

liked a Space 10 days ago

146

LLaDA

🚀

Large Language Diffusion Models

upvoted a paper 17 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 21 days ago • 162

upvoted a paper 18 days ago

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published 21 days ago • 92

upvoted 2 papers 21 days ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 21 days ago • 179

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 21 days ago • 97

upvoted a paper 27 days ago

CoSER: Coordinating LLM-Based Persona Simulation of Established Roles

Paper • 2502.09082 • Published 28 days ago • 28

liked a dataset 28 days ago

data-for-agents/insta-150k

Viewer • Updated 13 days ago • 147k • 677 • 5

liked a model 29 days ago

nomic-ai/nomic-embed-text-v2-moe

upvoted 3 papers about 1 month ago

Improving Transformer World Models for Data-Efficient RL

Paper • 2502.01591 • Published Feb 3 • 9

o3-mini vs DeepSeek-R1: Which One is Safer?

Paper • 2501.18438 • Published Jan 30 • 22

Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation

Paper • 2501.17433 • Published Jan 29 • 9

liked a model about 1 month ago

lingtrain/labse-udmurt

updated a model about 1 month ago

lingtrain/labse-udmurt

upvoted a paper about 1 month ago

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28 • 26

liked a model about 1 month ago

baichuan-inc/Baichuan-Omni-1d5

Updated Feb 8 • 353 • 42

upvoted a paper about 1 month ago

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published Jan 26 • 61