6 150 57

rotem israeli

irotem98

https://rotem154154.github.io

rotem154154

AI & ML interests

None yet

Recent Activity

upvoted a paper about 22 hours ago

Reangle-A-Video: 4D Video Generation as Video-to-Video Translation

upvoted a paper about 22 hours ago

TPDiff: Temporal Pyramid Video Diffusion Model

upvoted a paper 2 days ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

View all activity

Organizations

None yet

irotem98's activity

upvoted 2 papers about 22 hours ago

Reangle-A-Video: 4D Video Generation as Video-to-Video Translation

Paper • 2503.09151 • Published 2 days ago • 26

TPDiff: Temporal Pyramid Video Diffusion Model

Paper • 2503.09566 • Published 1 day ago • 37

upvoted a paper 2 days ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published 3 days ago • 54

upvoted 2 papers 3 days ago

LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning

Paper • 2503.04812 • Published 10 days ago • 12

EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer

Paper • 2503.07027 • Published 4 days ago • 23

upvoted a paper 5 days ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published 8 days ago • 79

upvoted a paper 14 days ago

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Paper • 2502.20321 • Published 15 days ago • 29

upvoted a paper 17 days ago

DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

Paper • 2502.17157 • Published 18 days ago • 51

upvoted a paper 19 days ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 22 days ago • 179

upvoted a paper 21 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 22 days ago • 129

upvoted a paper about 1 month ago

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28 • 26

upvoted a paper about 2 months ago

Diffusion Adversarial Post-Training for One-Step Video Generation

Paper • 2501.08316 • Published Jan 14 • 33

upvoted a paper 2 months ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 88

upvoted 7 papers 3 months ago

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Paper • 2412.17153 • Published Dec 22, 2024 • 34

Large Motion Video Autoencoding with Cross-modal Video VAE

Paper • 2412.17805 • Published Dec 23, 2024 • 24

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 352

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

Paper • 2412.09626 • Published Dec 12, 2024 • 20

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 90

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Paper • 2412.09619 • Published Dec 12, 2024 • 26

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 111