CheXagent and its "byproducts" Collection 📝 Paper: https://arxiv.org/abs/2401.12208 🧩 Github: https://github.com/Stanford-AIMI/CheXagent • 10 items • Updated Jan 12 • 2
Scaling Diffusion Language Models via Adaptation from Autoregressive Models Paper • 2410.17891 • Published Oct 23, 2024 • 16
WorldSimBench: Towards Video Generation Models as World Simulators Paper • 2410.18072 • Published Oct 23, 2024 • 20
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models Paper • 2410.17637 • Published Oct 23, 2024 • 35
Scalable Ranked Preference Optimization for Text-to-Image Generation Paper • 2410.18013 • Published Oct 23, 2024 • 15
DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes Paper • 2410.18084 • Published Oct 23, 2024 • 14
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding Paper • 2410.17434 • Published Oct 22, 2024 • 28
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding Paper • 2410.13924 • Published Oct 17, 2024 • 7
TP-Eval: Tap Multimodal LLMs' Potential in Evaluation by Customizing Prompts Paper • 2410.18071 • Published Oct 23, 2024 • 7
M-RewardBench: Evaluating Reward Models in Multilingual Settings Paper • 2410.15522 • Published Oct 20, 2024 • 12
ZePo: Zero-Shot Portrait Stylization with Faster Sampling Paper • 2408.05492 • Published Aug 10, 2024 • 7
DinoBloom: A Foundation Model for Generalizable Cell Embeddings in Hematology Paper • 2404.05022 • Published Apr 7, 2024 • 2
BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval Paper • 2403.15992 • Published Mar 24, 2024 • 1
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles Paper • 2410.05262 • Published Oct 7, 2024 • 10