HF中国镜像站

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2402.14904

Papers I find interesting

Scaling Instruction-Finetuned Language Models

Paper • 2210.11416 • Published Oct 20, 2022 • 7
Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 143
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8, 2024 • 64
Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7, 2024 • 63

Watermarking Makes Language Models Radioactive

Paper • 2402.14904 • Published Feb 22, 2024 • 24

Watermarking Makes Language Models Radioactive

Paper • 2402.14904 • Published Feb 22, 2024 • 24

Watermarking Makes Language Models Radioactive

Paper • 2402.14904 • Published Feb 22, 2024 • 24
ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition

Paper • 2402.15220 • Published Feb 23, 2024 • 21
GPTVQ: The Blessing of Dimensionality for LLM Quantization

Paper • 2402.15319 • Published Feb 23, 2024 • 21
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation

Paper • 2402.11929 • Published Feb 19, 2024 • 11

Theoretic stuff that may be useful in DL application

Watermarking Makes Language Models Radioactive

Paper • 2402.14904 • Published Feb 22, 2024 • 24

Watermarking Makes Language Models Radioactive

Paper • 2402.14904 • Published Feb 22, 2024 • 24

Models - Watermarking

On the Reliability of Watermarks for Large Language Models

Paper • 2306.04634 • Published Jun 7, 2023 • 5
Watermarking Makes Language Models Radioactive

Paper • 2402.14904 • Published Feb 22, 2024 • 24

Efficient Tool Use with Chain-of-Abstraction Reasoning

Paper • 2401.17464 • Published Jan 30, 2024 • 20
Transforming and Combining Rewards for Aligning Large Language Models

Paper • 2402.00742 • Published Feb 1, 2024 • 12
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 107
Specialized Language Models with Cheap Inference from Limited Domain Data

Paper • 2402.01093 • Published Feb 2, 2024 • 46

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 147
Orion-14B: Open-source Multilingual Large Language Models

Paper • 2401.12246 • Published Jan 20, 2024 • 13
MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 56
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24, 2024 • 47

bigscience/bloom

Text Generation • Updated Jul 28, 2023 • 457k • 4.86k
Runtime error

4

4

StableDiffusionBiasExplorer

🧑
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 105
Watermarking Makes Language Models Radioactive

Paper • 2402.14904 • Published Feb 22, 2024 • 24

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs