Vision transformers have shown great success due to their high model capabilities. Compared to the recent efficient model MobileViT-XXS, EfficientViT-M2 achieves 1.8% superior accuracy, while running <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$5.8\times/3.7\times$</tex> faster on the GPU/CPU, and <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$7.4\times faster$</tex> when converted to ONNX format.
This work deepens our understanding of the fundamental laws governing the universe, from subatomic particles to cosmic structures.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | ⚛️ Physics & Space Science |
| Published | Jun 01, 2023 |
| Journal | Research Journal |
| Authors | Xinyu Liu, Houwen Peng, Ningxin Zheng, Yuqing Yang, Han Hu |
| DOI | 10.1109/cvpr52729.2023.01386 |
| Citations | 729 |
| Source | OpenAlex |