We present the vector quantized diffusion (VQ-Diffusion) model for text-to-image generation. Our experiments indicate that the VQ-Diffusion model with the reparameterization is fifteen times faster than traditional AR methods while achieving a better image quality.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Jun 01, 2022 |
| Journal | 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) |
| Authors | Shuyang Gu, Dong Chen, Jianmin Bao, Fang Wen, Bo Zhang |
| DOI | 10.1109/cvpr52688.2022.01043 |
| Citations | 623 |
| Source | OpenAlex |