Abstract: Existing high-performance semantic segmentation methods for remote sensing often struggle with the tradeoff between inference efficiency and high-resolution processing, especially in ...
Abstract: In this paper, we introduce a simple but effective training-free pipeline for handling the task of text-to-video object segmentation. Our approach leverages open-source Multimodal Large ...
Abstract: Skeleton-based Temporal Action Segmentation (STAS) aims to densely parse untrimmed skeletal sequences into frame-level action categories. However, existing methods, while proficient at ...
This repository provides the codebase of SSP-SAM, a referring expression segmentation framework built on top of SAM with semantic-spatial prompts.