Abstract: Recent advancements in the domain of computer vision have enabled the analysis of audio spectrograms. In this paper, we present a novel approach that leverages spectrogram representations ...