Instruction tuning large language models (LLMs) using machine-generated instruction-following data has improved zero-shot capabilities on new tasks, but the idea is less explored in the multimodal field. When fine-tuned on Science QA, the synergy of LLaVA and GPT-4 achieves a new advanced accuracy of 92.53%.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Apr 17, 2023 |
| Journal | arXiv (Cornell University) |
| Authors | Haotian Liu, Chunyuan Li, Qingyang Wu, Yong Jae Lee |
| DOI | 10.48550/arxiv.2304.08485 |
| Citations | 679 |
| Source | OpenAlex |