We propose a simple pairwise sigmoid loss for imagetext pre-training. Finally, we push the batch size to the extreme, up to one million, and find that the benefits of growing batch size quickly diminish, with a more reasonable batch size of 32k being sufficient.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Oct 01, 2023 |
| Journal | Research Journal |
| Authors | Xiaohua Zhai, Basil Mustafa, А. И. Колесников, Lucas Beyer |
| DOI | 10.1109/iccv51070.2023.01100 |
| Citations | 614 |
| Source | OpenAlex |