Modern medical imaging increasingly relies on artificial intelligence to support detection, diagnosis, and prognostic ...
New research from FIU shows that some visual-language AI models have become particularly susceptible to image-based hacks.
This important work introduces an integrated open-source platform for behavioral acquisition and pose estimation that substantially improves the accessibility and speed of real-time animal tracking ...
Abstract: Unsupervised domain adaptive semantic segmentation (UDA-SS) for remote sensing imagery remains challenging due to the substantial distribution shifts across regions and sensors. Existing ...
microCLIP is a lightweight self-training framework that adapts CLIP for fine-grained image classification without requiring labeled data. While CLIP is strong in zero-shot transfer, it primarily ...
Abstract: Pre-trained vision-language models (VLMs) and language models (LMs) have recently garnered significant attention due to their remarkable ability to represent textual concepts, opening up new ...