We present ControlNet, a neural network architecture to add spatial conditioning controls to large, pretrained text-to-image diffusion models. We show that the training of ControlNets is robust with small (<50k) and large (>1m) datasets.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Oct 01, 2023 |
| Journal | Research Journal |
| Authors | Lvmin Zhang, Anyi Rao, Maneesh Agrawala |
| DOI | 10.1109/iccv51070.2023.00355 |
| Citations | 3,568 |
| Source | OpenAlex |