Image-Gen
updated
Alleviating Distortion in Image Generation via Multi-Resolution
Diffusion Models
Paper
• 2406.09416
• Published
• 29
Wavelets Are All You Need for Autoregressive Image Generation
Paper
• 2406.19997
• Published
• 31
ViPer: Visual Personalization of Generative Models via Individual
Preference Learning
Paper
• 2407.17365
• Published
• 13
MegaFusion: Extend Diffusion Models towards Higher-resolution Image
Generation without Further Tuning
Paper
• 2408.11001
• Published
• 13
TraDiffusion: Trajectory-Based Training-Free Image Generation
Paper
• 2408.09739
• Published
• 9
Paper
• 2408.07009
• Published
• 62
LinFusion: 1 GPU, 1 Minute, 16K Image
Paper
• 2409.02097
• Published
• 34
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
Paper
• 2409.11355
• Published
• 30
MaskBit: Embedding-free Image Generation via Bit Tokens
Paper
• 2409.16211
• Published
• 17
ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation
Paper
• 2410.01731
• Published
• 16
Fluid: Scaling Autoregressive Text-to-image Generative Models with
Continuous Tokens
Paper
• 2410.13863
• Published
• 37
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Paper
• 2410.10812
• Published
• 18
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved
Visual Representation Capabilities
Paper
• 2410.14672
• Published
• 8
PUMA: Empowering Unified MLLM with Multi-granular Visual Generation
Paper
• 2410.13861
• Published
• 56
Scalable Ranked Preference Optimization for Text-to-Image Generation
Paper
• 2410.18013
• Published
• 14
GrounDiT: Grounding Diffusion Transformers via Noisy Patch
Transplantation
Paper
• 2410.20474
• Published
• 14
Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion
Models
Paper
• 2411.05005
• Published
• 13
Edify Image: High-Quality Image Generation with Pixel Space Laplacian
Diffusion Models
Paper
• 2411.07126
• Published
• 30
Continuous Speculative Decoding for Autoregressive Image Generation
Paper
• 2411.11925
• Published
• 16
Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
Paper
• 2411.17787
• Published
• 12
One Diffusion to Generate Them All
Paper
• 2411.16318
• Published
• 28
Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis
Paper
• 2412.01819
• Published
• 34
Steering Rectified Flow Models in the Vector Field for Controlled Image
Generation
Paper
• 2412.00100
• Published
• 17
Negative Token Merging: Image-based Adversarial Feature Guidance
Paper
• 2412.01339
• Published
• 22
Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution
Image Synthesis
Paper
• 2412.04431
• Published
• 17
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free
Scale Fusion
Paper
• 2412.09626
• Published
• 21
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute
in Linear Diffusion Transformer
Paper
• 2501.18427
• Published
• 24
LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion
Transformer
Paper
• 2502.01105
• Published
• 21