Abstract: The advent of Vision Transformers (ViTs) has significantly reshaped the landscape of computer vision, delivering competitive performance across a wide range of visual recognition tasks.