For Content Creator
updated
Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era
Paper
• 2305.06131
• Published
• 2
Perpetual Humanoid Control for Real-time Simulated Avatars
Paper
• 2305.06456
• Published
• 1
Drag Your GAN: Interactive Point-based Manipulation on the Generative
Image Manifold
Paper
• 2305.10973
• Published
• 39
LDM3D: Latent Diffusion Model for 3D
Paper
• 2305.10853
• Published
• 13
OpenShape: Scaling Up 3D Shape Representation Towards Open-World
Understanding
Paper
• 2305.10764
• Published
• 7
Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D
Diffusion Probabilistic Models
Paper
• 2305.11870
• Published
• 4
StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity
3D Avatar Generation
Paper
• 2305.19012
• Published
• 5
AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation
Paper
• 2305.19245
• Published
• 2
AniFaceDrawing: Anime Portrait Exploration during Your Sketching
Paper
• 2306.07476
• Published
• 19
AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation
Paper
• 2306.09864
• Published
• 15
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based
Image Editing
Paper
• 2306.14435
• Published
• 21
Magic123: One Image to High-Quality 3D Object Generation Using Both 2D
and 3D Diffusion Priors
Paper
• 2306.17843
• Published
• 44
SDXL: Improving Latent Diffusion Models for High-Resolution Image
Synthesis
Paper
• 2307.01952
• Published
• 90
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
Paper
• 2307.02421
• Published
• 35
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models
without Specific Tuning
Paper
• 2307.04725
• Published
• 65
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based
Image Manipulation
Paper
• 2308.00906
• Published
• 15
ConceptLab: Creative Generation using Diffusion Prior Constraints
Paper
• 2308.02669
• Published
• 25
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image
Diffusion Models
Paper
• 2308.06721
• Published
• 36
ControlMat: A Controlled Generative Approach to Material Capture
Paper
• 2309.01700
• Published
• 17
Deep Geometrized Cartoon Line Inbetweening
Paper
• 2309.16643
• Published
• 26
Matryoshka Diffusion Models
Paper
• 2310.15111
• Published
• 45
FaceStudio: Put Your Face Everywhere in Seconds
Paper
• 2312.02663
• Published
• 32
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded
Diffusion Model
Paper
• 2312.02238
• Published
• 27
AnimateZero: Video Diffusion Models are Zero-Shot Image Animators
Paper
• 2312.03793
• Published
• 18
VecFusion: Vector Font Generation with Diffusion
Paper
• 2312.10540
• Published
• 22
SDXL-Lightning: Progressive Adversarial Diffusion Distillation
Paper
• 2402.13929
• Published
• 27
Genie: Generative Interactive Environments
Paper
• 2402.15391
• Published
• 72
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with
Audio2Video Diffusion Model under Weak Conditions
Paper
• 2402.17485
• Published
• 194
Sora: A Review on Background, Technology, Limitations, and Opportunities
of Large Vision Models
Paper
• 2402.17177
• Published
• 88
Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in
Text-to-Image Generation
Paper
• 2402.17245
• Published
• 11
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Paper
• 2403.05135
• Published
• 45
AnimateDiff-Lightning: Cross-Model Diffusion Distillation
Paper
• 2403.12706
• Published
• 18
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of
Text-to-Image Models
Paper
• 2403.13535
• Published
• 23
ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars
Paper
• 2403.15383
• Published
• 15
SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions
Paper
• 2403.16627
• Published
• 22
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Paper
• 2403.17694
• Published
• 12
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image
Generation
Paper
• 2404.02733
• Published
• 22
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
Matching
Paper
• 2404.03653
• Published
• 35
ControlNet++: Improving Conditional Controls with Efficient Consistency
Feedback
Paper
• 2404.07987
• Published
• 48
AniClipart: Clipart Animation with Text-to-Video Priors
Paper
• 2404.12347
• Published
• 13
Dynamic Typography: Bringing Words to Life
Paper
• 2404.11614
• Published
• 46
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image
Synthesis
Paper
• 2404.13686
• Published
• 29
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Paper
• 2404.16022
• Published
• 25
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
Paper
• 2404.19427
• Published
• 74
Compositional Text-to-Image Generation with Dense Blob Representations
Paper
• 2405.08246
• Published
• 17
Toon3D: Seeing Cartoons from a New Perspective
Paper
• 2405.10320
• Published
• 22
FIFO-Diffusion: Generating Infinite Videos from Text without Training
Paper
• 2405.11473
• Published
• 56
3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian
Splatting
Paper
• 2405.18424
• Published
• 9
I4VGen: Image as Stepping Stone for Text-to-Video Generation
Paper
• 2406.02230
• Published
• 18
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Paper
• 2406.04333
• Published
• 38
Step-aware Preference Optimization: Aligning Preference with Denoising
Performance at Each Step
Paper
• 2406.04314
• Published
• 30
Commonsense-T2I Challenge: Can Text-to-Image Generation Models
Understand Commonsense?
Paper
• 2406.07546
• Published
• 9
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN
Inversion and High Quality Image Editing
Paper
• 2406.10601
• Published
• 70
Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images
Paper
• 2406.13393
• Published
• 5
Magic Insert: Style-Aware Drag-and-Drop
Paper
• 2407.02489
• Published
• 21
Paper
• 2407.14358
• Published
• 26
OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any
Person
Paper
• 2407.16224
• Published
• 29
IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning
using Instruct Prompts
Paper
• 2408.03209
• Published
• 22
Transformer Explainer: Interactive Learning of Text-Generative Models
Paper
• 2408.04619
• Published
• 175
Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from
User's Casual Sketches
Paper
• 2408.04567
• Published
• 26
ControlNeXt: Powerful and Efficient Control for Image and Video
Generation
Paper
• 2408.06070
• Published
• 55
UniPortrait: A Unified Framework for Identity-Preserving Single- and
Multi-Human Image Personalization
Paper
• 2408.05939
• Published
• 14
ZePo: Zero-Shot Portrait Stylization with Faster Sampling
Paper
• 2408.05492
• Published
• 7
CustomCrafter: Customized Video Generation with Preserving Motion and
Concept Composition Abilities
Paper
• 2408.13239
• Published
• 11
CSGO: Content-Style Composition in Text-to-Image Generation
Paper
• 2408.16766
• Published
• 18
LinFusion: 1 GPU, 1 Minute, 16K Image
Paper
• 2409.02097
• Published
• 34
IFAdapter: Instance Feature Control for Grounded Text-to-Image
Generation
Paper
• 2409.08240
• Published
• 22
InstantDrag: Improving Interactivity in Drag-based Image Editing
Paper
• 2409.08857
• Published
• 34
DrawingSpinUp: 3D Animation from Single Character Drawings
Paper
• 2409.08615
• Published
• 19
Click2Mask: Local Editing with Dynamic Mask Generation
Paper
• 2409.08272
• Published
• 5
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
Paper
• 2409.11355
• Published
• 30
LVCD: Reference-based Lineart Video Colorization with Diffusion Models
Paper
• 2409.12960
• Published
• 24
StoryMaker: Towards Holistic Consistent Characters in Text-to-image
Generation
Paper
• 2409.12576
• Published
• 16
MIMO: Controllable Character Video Synthesis with Spatial Decomposed
Modeling
Paper
• 2409.16160
• Published
• 34
Improvements to SDXL in NovelAI Diffusion V3
Paper
• 2409.15997
• Published
• 13
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through
Data, Reward, and Conditional Guidance Design
Paper
• 2410.05677
• Published
• 14
Story-Adapter: A Training-free Iterative Framework for Long Story
Visualization
Paper
• 2410.06244
• Published
• 20
TextToon: Real-Time Text Toonify Head Avatar from Single Video
Paper
• 2410.07160
• Published
• 7
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image
Generation
Paper
• 2410.08159
• Published
• 26
Animate-X: Universal Character Image Animation with Enhanced Motion
Representation
Paper
• 2410.10306
• Published
• 56
MagicTailor: Component-Controllable Personalization in Text-to-Image
Diffusion Models
Paper
• 2410.13370
• Published
• 37