A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner,
M. Dehghani, M. Minderer, G. Heigold, S. Gelly, 2020, An image is worth 16x16 words:
Transformers for image recognition at scale, in International Conference on Learning
Representations (ICLR)
