9 98 18

Dhruv Diddi

ddiddi

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

LocAgent: Graph-Guided LLM Agents for Code Localization

upvoted a paper 1 day ago

Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol

upvoted a paper 1 day ago

AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models

View all activity

Organizations

ddiddi's activity

upvoted 7 papers 1 day ago

LocAgent: Graph-Guided LLM Agents for Code Localization

Paper • 2503.09089 • Published 2 days ago • 5

Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol

Paper • 2503.05860 • Published 7 days ago • 7

AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models

Paper • 2503.08417 • Published 3 days ago • 6

"Principal Components" Enable A New Language of Images

Paper • 2503.08685 • Published 3 days ago • 10

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published 3 days ago • 89

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published 4 days ago • 31

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published 4 days ago • 73

liked a model 8 days ago

bartowski/Qwen_QwQ-32B-GGUF

Text Generation • Updated 9 days ago • 172k • 142

liked a model 9 days ago

Qwen/QwQ-32B

Text Generation • Updated 3 days ago • 296k • • 2.16k

published a Space 9 days ago

Solo Qwen QwQ 32B

💬

liked a Space 17 days ago

Llasa 1b Multilingual TTS

🌍

Generate speech from text with or without cloning a voice

reacted to JingzeShi's post with 🚀 19 days ago

Post

2944

🤗Welcome to the Doge Edge Device Small language Model.

SmallDoge/Doge-160M-Instruct

upvoted 2 papers about 1 month ago

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published Jan 30 • 21

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 111

liked a model about 1 month ago

bartowski/Mistral-Small-24B-Instruct-2501-GGUF

Text Generation • Updated Jan 30 • 45.3k • 108

upvoted 4 papers about 2 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 85

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 346

GPS as a Control Signal for Image Generation

Paper • 2501.12390 • Published Jan 21 • 12

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published Jan 22 • 57

liked a Space about 2 months ago

546

DeepSeek-R1 WebGPU

🧠

Next-generation reasoning model that runs locally in-browser