Home / Research Library / Palette: Image-to-Image Diffusion Models
🤖 Artificial Intelligence OpenAlex

Palette: Image-to-Image Diffusion Models

📅 July 20, 2022 👤 Chitwan Saharia, William Chan, Huiwen Chang et al. 📖 Research Journal 📊 1,495 citations

🤖 Plain-English Summary

This paper develops a unified framework for image-to-image translation based on conditional diffusion models and evaluates this framework on four challenging image-to-image translation tasks, namely colorization, inpainting, uncropping, and JPEG restoration. Finally, we show that a generalist, multi-task diffusion model performs as well or better than task-specific specialist counterparts.

🔑 Key Findings

  • Our simple implementation of image-to-image diffusion models outperforms strong GAN and regression baselines on all tasks, without task-specific hyper-parameter tuning, architecture customization, or any auxiliary loss or sophisticated new techniques needed.
  • We uncover the impact of an L2 vs.
  • L1 loss in the denoising diffusion objective on sample diversity, and demonstrate the importance of self-attention in the neural architecture through empirical studies.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Jul 20, 2022
Journal Research Journal
Authors Chitwan Saharia, William Chan, Huiwen Chang, Chris Lee, Jonathan Ho
DOI 10.1145/3528233.3530757
Citations 1,495
Source OpenAlex

More 🤖 Artificial Intelligence Research