[TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation
☆47Jan 20, 2024Updated 2 years ago
Alternatives and similar repositories for Locater
Users that are interested in Locater are comparing it to the libraries listed below
Sorting:
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆24Aug 12, 2022Updated 3 years ago
- Refer-Youtube-VOS dataset☆27Mar 10, 2026Updated last week
- [CVPR2022] Official Implementation of ReferFormer☆352Feb 15, 2025Updated last year
- Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross-Modal Denoising Networks☆24Sep 6, 2022Updated 3 years ago
- ☆17Jun 21, 2022Updated 3 years ago
- RefVOS☆28Feb 3, 2021Updated 5 years ago
- Repository of our accepted CVPR2022 paper "Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-La…☆28Mar 4, 2022Updated 4 years ago
- Official Pytorch implementation of "Visual Recognition with Deep Nearest Centroids". (ICLR2023 Spotlight)☆69Feb 1, 2023Updated 3 years ago
- This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"☆19Feb 19, 2023Updated 3 years ago
- This is the official implementation of "Transferring to Real-World Layouts: A Depth-aware Framework for Scene Adaptation" (Accepted at AC…☆14Aug 24, 2024Updated last year
- Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.☆63Feb 2, 2021Updated 5 years ago
- [CVPR 2022] Visual Abductive Reasoning☆124Oct 22, 2024Updated last year
- Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"☆94Apr 27, 2023Updated 2 years ago
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆31Jul 16, 2024Updated last year
- Video Object Segmentation with Episodic Graph Memory Networks (ECCV2020 spotlight)☆92Sep 10, 2020Updated 5 years ago
- [ICCV 2023] Official code release of our paper "Referring Image Segmentation Using Text Supervision"☆73Oct 13, 2024Updated last year
- CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View☆389Jun 30, 2022Updated 3 years ago
- ☆99Sep 5, 2023Updated 2 years ago
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆37Apr 27, 2024Updated last year
- ☆20May 11, 2025Updated 10 months ago
- [ECCV2024] Nonverbal Interaction Detection☆29Oct 30, 2024Updated last year
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Mar 13, 2024Updated 2 years ago
- [CVPR'24] Neural Clustering based Visual Representation Learning☆44Oct 6, 2025Updated 5 months ago
- RVOS: End-to-End Recurrent Network for Video Object Segmentation (CVPR 2019)☆277Nov 22, 2022Updated 3 years ago
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Aug 19, 2023Updated 2 years ago
- Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"☆13Jun 17, 2024Updated last year
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆58Oct 7, 2023Updated 2 years ago
- Calculation of the entropy of the batch of images (whole image or patches)☆10Oct 15, 2021Updated 4 years ago
- The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`☆44Apr 9, 2022Updated 3 years ago
- Official code for "Opening up Open World Tracking" (CVPR 2022)☆55Apr 8, 2023Updated 2 years ago
- VITA: Video Instance Segmentation via Object Token Association (NeurIPS 2022)☆105Jan 4, 2024Updated 2 years ago
- This is the official implementation of "GvSeg: General and Task-Oriented Video Segmentation" (Accepted at ECCV 2024).☆18Jul 15, 2024Updated last year
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆29Dec 16, 2024Updated last year
- [CVPR'23] A Generalized Framework for Video Instance Segmentation☆136Jan 4, 2024Updated 2 years ago
- [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"☆13Dec 1, 2024Updated last year
- ☆26Oct 8, 2021Updated 4 years ago
- CVPR2022 - Deep Hierarchical Semantic Segmentation - A structured, pixel-wise description of visual scenes in terms of the class hierarch…☆254Apr 24, 2023Updated 2 years ago
- [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation☆124Apr 12, 2024Updated last year
- 「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆83Jun 13, 2025Updated 9 months ago