Over 70 million people in the U.S. are impacted by hearing loss, and age-related hearing loss is the second most common ...
Abstract: The rise of deep-fake technology has sparked concerns as it blurs the distinction between fake media by harnessing Generative Adversarial Networks (GANs). This has raised issues surrounding ...
Artificial Intelligence (AI), especially deep learning, has significantly impacted audio and video signal processing. With large-scale multimodal datasets and enhanced computational resources, AI is ...
Abstract: Traffic surveillance is a key factor in ITS whereby accurate and real-time object detection assures improvement of road safety and traffic management. This paper advances a ...
We introduce MMAR, a new benchmark designed to evaluate the deep reasoning capabilities of Audio-Language Models (ALMs) across massive multi-disciplinary tasks. MMAR comprises 1,000 meticulously ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results