Home / Research Library / DreamBooth: Fine Tuning Text-to-Image Diffusion Mo...
🤖 Artificial Intelligence OpenAlex

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

📅 June 1, 2023 👤 Nataniel Ruiz, Yuanzhen Li, Varun Jampani et al. 📖 Research Journal 📊 1,940 citations

🤖 Plain-English Summary

Large text-to-image models achieved a remarkable leap in the evolution of AI, enabling high-quality and diverse synthesis of images from a given text prompt. We also provide a new dataset and evaluation protocol for this new task of subject-driven generation.

🔑 Key Findings

  • However, these models lack the ability to mimic the appearance of subjects in a given reference set and synthesize novel renditions of them in different contexts.
  • In this work, we present a new approach for “personalization” of text-to-image diffusion models.
  • Given as input just a few images of a subject, we fine-tune a pretrained text-to-image model such that it learns to bind a unique identifier with that specific subject.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Jun 01, 2023
Journal Research Journal
Authors Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael Rubinstein
DOI 10.1109/cvpr52729.2023.02155
Citations 1,940
Source OpenAlex

More 🤖 Artificial Intelligence Research