Full Text Search - HF中国镜像站

HF中国镜像站

Full-text search

models datasets spaces

+ 1,000 results

lumaai / imm

README.md

model

1 matches

tags: unconditional-image-generation, arxiv:2503.07565, license:apache-2.0, region:us

# Inductive Moment Matching

Model checkpoints for [Inductive Moment Matching](https://arxiv.org/abs/2503.07565).

hustvl / lightningdit-xl-imagenet256-64ep

README.md

model

1 matches

tags: unconditional-image-generation, arxiv:2501.01423, license:mit, region:us

This model is introduced in the paper [Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models](https://arxiv.org/abs/2501.01423).

Code: https://github.com/hustvl/mar.

changwh5 / Stylegan2-ada

README.md

model

1 matches

tags: unconditional-image-generation, dataset:student/FFHQ, arxiv:2006.06676, region:us

# Model Card for Model ID

Trained on an A6000 graphics card, based on Pytorch code using the FFHQ dataset with image size of 256x256.

BooBooWu / Vec2Face

README.md

model

1 matches

tags: unconditional-image-generation, arxiv:2409.02979, license:mit, region:us

# Vec2Face Model Card

<div align="center">

[**Project Page**](https://haiyuwu.github.io/vec2face.github.io/) **|** [**Paper**](https://arxiv.org/abs/2409.02979) **|** [**Code**](https://github.com/HaiyuWu/vec2face) **|** [🤗 **Gradio demo**](https://huggingface.co/spaces/BooBooWu/Vec2Face)

mit-han-lab / hart-0.7b-1024px

README.md

model

1 matches

tags: unconditional-image-generation, arxiv:2410.10812, license:mit, region:us

# HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

We introduce Hybrid Autoregressive Transformer (HART), an autoregressive (AR) visual generation model capable of directly generating 1024x1024 images, rivaling diffusion models in image generation quality. Existing AR models face limitations due to the poor image reconstruction quality of their discrete tokenizers and the prohibitive training costs associated with generating 1024px images. To address these challenges, we present the hybrid tokenizer, which decomposes the continuous latents from the autoencoder into two components: discrete tokens representing the big picture and continuous tokens representing the residual components that cannot be represented by the discrete tokens. The discrete component is modeled by a scalable-resolution discrete AR model, while the continuous component is learned with a lightweight residual diffusion module with only 37M parameters. Compared with the discrete-only VAR tokenizer, our hybrid approach improves reconstruction FID from 2.11 to 0.30 on MJHQ-30K, leading to a 31% generation FID improvement from 7.85 to 5.38. HART also outperforms state-of-the-art diffusion models in both FID and CLIP score, with 4.5-7.7x higher throughput and 6.9-13.4x lower MACs.

Gusanagy / UDBE-Unsupervised-Diffusion-based-Brightness-Enhancement-in-Underwater-Images

model

1 matches

tags: unconditional-image-generation, license:mit, region:us

brownvc / R3GAN-ImgNet-64x64

README.md

model

1 matches

tags: unconditional-image-generation, dataset:ILSVRC/imagenet-1k, arxiv:2501.05441, region:us

# Model Card for ImageNet 64x64 R3GAN Model

This model card provides details about the R3GAN model trained on the ImageNet dataset found in the NeurIPS 2024 paper R3GAN: https://arxiv.org/abs/2501.05441

## Model Details

brownvc / R3GAN-FFHQ-256x256

README.md

model

1 matches

tags: unconditional-image-generation, arxiv:2501.05441, region:us

# Model Card for FFHQ 256x256 R3GAN Model

This model card provides details about the R3GAN model trained on the FFHQ-256 dataset found in the NeurIPS 2024 paper: https://arxiv.org/abs/2501.05441

## Model Details

brownvc / R3GAN-FFHQ-64x64

README.md

model

1 matches

tags: unconditional-image-generation, arxiv:2501.05441, region:us

# Model Card for FFHQ 64x64 R3GAN Model

This model card provides details about the R3GAN model trained on the FFHQ-64 dataset found in the NeurIPS 2024 paper: https://arxiv.org/abs/2501.05441

## Model Details

brownvc / R3GAN-CIFAR10

README.md

model

1 matches

tags: unconditional-image-generation, dataset:uoft-cs/cifar10, arxiv:2501.05441, region:us

# Model Card for FFHQ 64x64 R3GAN Model

This model card provides details about the R3GAN model trained on the CIFAR10 dataset found in the NeurIPS 2024 paper: https://arxiv.org/abs/2501.05441

## Model Details

brownvc / R3GAN-ImgNet-32x32

README.md

model

1 matches

tags: unconditional-image-generation, dataset:ILSVRC/imagenet-1k, arxiv:2501.05441, region:us

# Model Card for ImageNet 32x32 R3GAN Model

This model card provides details about the R3GAN model trained on the ImageNet dataset found in the NeurIPS 2024 paper: https://arxiv.org/abs/2501.05441

## Model Details

Epiphqny / PAR

README.md

model

1 matches

tags: unconditional-image-generation, arxiv:2412.15119, region:us

# Parallelized Autoregressive Visual Generation

Autoregressive models have emerged as a powerful approach for visual generation but suffer from slow inference speed due to their sequential token-by-token prediction process. In this paper, we propose a simple yet effective approach for parallelized autoregressive visual generation that improves generation efficiency while preserving the advantages of autoregressive modeling. Our key insight is that parallel generation depends on visual token dependencies—tokens with weak dependencies can be generated in parallel, while strongly dependent adjacent tokens are difficult to generate together, as their independent sampling may lead to inconsistencies. Based on this observation, we develop a parallel generation strategy that generates distant tokens with weak dependencies in parallel while maintaining sequential generation for strongly dependent local tokens. Our approach can be seamlessly integrated into standard autoregressive models without modifying the architecture or tokenizer. Experiments on ImageNet and UCF-101 demonstrate that our method achieves a 3.6× speedup with comparable quality and up to 9.5× speedup with minimal quality degradation across both image and video generation tasks.

Abhiroop174 / galaxy_gen

README.md

model

1 matches

tags: unconditional-image-generation, en, license:mit, region:us

`galaxy_gen` is a library to generate galaxy data/distributions. The models used are present in this page.

## Installation

keras-io / VGG19

README.md

model

1 matches

tags: tf-keras, unconditional-image-generation, region:us

## Model description

This repo contains the model for the notebook [Neural style transfer](https://keras.io/examples/generative/neural_style_transfer/).

Full credits go to [fchollet](https://twitter.com/fchollet)

skytnt / fbanime-gan

README.md

model

1 matches

tags: onnx, unconditional-image-generation, license:apache-2.0, region:us

Project [fbanimegan](https://github.com/SkyTNT/fbanimegan)

### fbanime.pkl

uripper / GIANNIS

model

1 matches

tags: diffusers, unconditional-image-generation, diffusers:DDPMPipeline, region:us

cmudrc / 2d-lattice-decoder

model

1 matches

tags: tf-keras, unconditional-image-generation, en, dataset:cmudrc/2d-lattices, license:mit, region:us

utsavnandi / fashion-mnist-ddpm-32px-5000_steps

README.md

model

1 matches

tags: pytorch, unconditional-image-generation, license:mit, region:us

Fashion MNIST unconditional Unet Model trained using DDPM

Model Hyperparams:

- Model size: 51,834,625 params

changwh5 / BigBiGAN-MNIST-150epoch

README.md

model

1 matches

tags: code, unconditional-image-generation, zh, dataset:mnist, region:us

# Model Card for Model ID

This modelcard aims to be a base template for new models. It has been generated using (https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1).

nlightcho / ddpm-ema-heightmap-512-10m

model

1 matches

tags: diffusers, unconditional-image-generation, license:cc-by-nc-4.0, diffusers:DDPMPipeline, region:us