[ICCV 2023 & TPAMI 2025] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
video-understanding multimodal-learning referring-expression-segmentation referring-expression-comprehension referring-video-object-segmentation mose-dataset mevis-dataset
-
Updated
Jan 8, 2026 - Python