Submitted by akhaliq 70 Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation · 7 authors 3
Submitted by akhaliq 28 Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning · 4 authors 2
Submitted by akhaliq 23 Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis · 7 authors 5
Submitted by akhaliq 19 VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers · 9 authors
Submitted by akhaliq 16 Margin-aware Preference Optimization for Aligning Diffusion Models without Reference · 6 authors 1
Submitted by akhaliq 16 ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization · 9 authors
Submitted by akhaliq 13 MLCM: Multistep Consistency Distillation of Latent Diffusion Model · 6 authors
Submitted by akhaliq 12 ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models · 6 authors
Submitted by akhaliq 12 GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement · 10 authors