HF中国镜像站

new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Mar 7

Submitted by

akhaliq

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

·
6 authors

Submitted by

akhaliq

SaulLM-7B: A pioneering Large Language Model for Law

·
11 authors

5

Submitted by

akhaliq

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

·
8 authors

Submitted by

akhaliq

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

·
10 authors

Submitted by

akhaliq

Learning to Decode Collaboratively with Multiple Language Models

·
5 authors

6

Submitted by

akhaliq

Enhancing Vision-Language Pre-training with Rich Supervisions

·
10 authors

Submitted by

akhaliq

Stop Regressing: Training Value Functions via Classification for Scalable Deep RL

·
12 authors

Submitted by

akhaliq

3D Diffusion Policy

·
6 authors

Submitted by

akhaliq

Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling

·
6 authors

Submitted by

akhaliq

Backtracing: Retrieving the Cause of the Query

·
5 authors