A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner,
                           M. Dehghani, M. Minderer, G. Heigold, S. Gelly, 2020, An image is worth 16x16 words:
                           Transformers for image recognition at scale, in International Conference on Learning
                           Representations (ICLR)
