It is important to clarify: we do not use VLMs to drive the robot. Using a heavy cloud model to steer in real time would ...
Abstract: Remote sensing image semantic segmentation is a fundamental task in geospatial information analysis, aiming to assign meaningful semantic labels to every pixel in high resolution satellite ...