HF中国镜像站

new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Jun 17

Submitted by

Howuhh

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

·
6 authors

1

Submitted by

Royir

Make It Count: Text-to-Image Generation with an Accurate Number of Objects

·
6 authors

3

Submitted by

Cheng-YANG

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

·
14 authors

2

Submitted by

Weiyun1025

Needle In A Multimodal Haystack

·
16 authors

1

Submitted by

yurakuratov

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

·
7 authors

4

Submitted by

tellarin

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

·
61 authors

1

Submitted by

Weiyun1025

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

·
40 authors

3

Submitted by

wqshao126

GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices

·
10 authors

1

Submitted by

GlyphByT5

Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering

·
6 authors

Submitted by

davanstrien

GEB-1.3B: Open Lightweight Large Language Model

·
4 authors

3

Submitted by

akhaliq

Training-free Camera Control for Video Generation

·
4 authors

2

Submitted by

YidaChen

Designing a Dashboard for Transparency and Control of Conversational AI

·
12 authors

Submitted by

KevinQHLin

VideoGUI: A Benchmark for GUI Automation from Instructional Videos

·
8 authors

1

Submitted by

wqshao126

Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality

·
12 authors

Submitted by

ahans1

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

·
11 authors

1

Submitted by

bing-li-ai

Vivid-ZOO: Multi-View Video Generation with Diffusion Model

·
7 authors

3

Submitted by

happy0612

AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis

·
5 authors

1

Submitted by

ankgoyal

RVT-2: Learning Precise Manipulation from Few Demonstrations

·
6 authors

1

Submitted by

deeptimhe

GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors

·
4 authors

2

Submitted by

amanchadha

Decoding the Diversity: A Review of the Indic AI Research Landscape

·
5 authors

1

Submitted by

kargaranamir

MaskLID: Code-Switching Language Identification through Iterative Masking

·
3 authors

1