Home / Research Library / T2I-Adapter: Learning Adapters to Dig Out More Con...
🤖 Artificial Intelligence OpenAlex

T2I-Adapter: Learning Adapters to Dig Out More Controllable Ability for Text-to-Image Diffusion Models

📅 March 24, 2024 👤 Chong Mou, Xintao Wang, Liangbin Xie et al. 📖 Proceedings of the AAAI Conference on Artificial Intelligence 📊 711 citations

🤖 Plain-English Summary

The incredible generative ability of large-scale text-to-image (T2I) models has demonstrated strong power of learning complex structures and meaningful semantics. Extensive experiments demonstrate that our T2I-Adapter has promising generation quality and a wide range of applications.

🔑 Key Findings

  • However, relying solely on text prompts cannot fully take advantage of the knowledge learned by the model, especially when flexible and accurate controlling (e.g., structure and color) is needed.
  • In this paper, we aim to ``dig out" the capabilities that T2I models have implicitly learned, and then explicitly use them to control the generation more granularly.
  • Specifically, we propose to learn low-cost T2I-Adapters to align internal knowledge in T2I models with external control signals, while freezing the original large T2I models.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Mar 24, 2024
Journal Proceedings of the AAAI Conference on Artificial Intelligence
Authors Chong Mou, Xintao Wang, Liangbin Xie, Yanze Wu, Jian Zhang
DOI 10.1609/aaai.v38i5.28226
Citations 711
Source OpenAlex

More 🤖 Artificial Intelligence Research