FreeU: Free Lunch in Diffusion U-Net
Paper
• 2309.11497
• Published • 66
Imagic: Text-Based Real Image Editing with Diffusion Models
Paper
• 2210.09276
• Published • 1
On Architectural Compression of Text-to-Image Diffusion Models
Paper
• 2305.15798
• Published • 5
Wuerstchen: Efficient Pretraining of Text-to-Image Models
Paper
• 2306.00637
• Published • 13
CLIP-KD: An Empirical Study of Distilling CLIP Models
Paper
• 2307.12732
• Published
Online Clustered Codebook
Paper
• 2307.15139
• Published • 1
Residual Denoising Diffusion Models
Paper
• 2308.13712
• Published • 3
InstaFlow: One Step is Enough for High-Quality Diffusion-Based
Text-to-Image Generation
Paper
• 2309.06380
• Published • 33
Restart Sampling for Improving Generative Processes
Paper
• 2306.14878
• Published • 5
Controlling Text-to-Image Diffusion by Orthogonal Finetuning
Paper
• 2306.07280
• Published • 25
Emu: Enhancing Image Generation Models Using Photogenic Needles in a
Haystack
Paper
• 2309.15807
• Published • 33
Finite Scalar Quantization: VQ-VAE Made Simple
Paper
• 2309.15505
• Published • 24
Muse: Text-To-Image Generation via Masked Generative Transformers
Paper
• 2301.00704
• Published
PixArt-α: Fast Training of Diffusion Transformer for
Photorealistic Text-to-Image Synthesis
Paper
• 2310.00426
• Published • 61
DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model
Statistics
Paper
• 2310.13268
• Published • 18
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons
Images
Paper
• 2310.16825
• Published • 36
Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of
Experts And Frequency-augmented Decoder Approach
Paper
• 2310.12004
• Published • 2
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial
Understanding
Paper
• 2310.15308
• Published • 23
Matryoshka Diffusion Models
Paper
• 2310.15111
• Published • 45
Beyond U: Making Diffusion Models Faster & Lighter
Paper
• 2310.20092
• Published • 12
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion
Models
Paper
• 2311.04145
• Published • 34
Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion
Models
Paper
• 2309.14068
• Published • 1
Denoising Diffusion Step-aware Models
Paper
• 2310.03337
• Published • 1
DiffNAS: Bootstrapping Diffusion Models by Prompting for Better
Architectures
Paper
• 2310.04750
• Published • 1
PIXART-δ: Fast and Controllable Image Generation with Latent
Consistency Models
Paper
• 2401.05252
• Published • 49
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and
Generating with Multimodal LLMs
Paper
• 2401.11708
• Published • 30
UNIMO-G: Unified Image Generation through Multimodal Conditional
Diffusion
Paper
• 2401.13388
• Published • 13
Deconstructing Denoising Diffusion Models for Self-Supervised Learning
Paper
• 2401.14404
• Published • 18
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
Matching
Paper
• 2404.03653
• Published • 35