Page Object Model in Selenium with Java

Mitigating Object Hallucination in Large Vision-Language Models via Visual Attention Direct Preference Optimization

Abstract: Large Vision-Language Models (LVLMs) suffer from severe object hallucinations, leading them to frequently generate outputs that do not correspond to the image content, significantly reducing ...

IEEE

EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild

Abstract: Our work aims to reconstruct hand-object interactions from a single-view image, which is a fundamental but ill-posed task. Unlike methods that reconstruct from videos, multiview images, or ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Mitigating Object Hallucination in Large Vision-Language Models via Visual Attention Direct Preference Optimization

EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild

Trending now