guanxiongsun / vfe.pytorch
Video Feature Enhancement with PyTorch
☆24Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for vfe.pytorch
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆35Updated last year
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated 11 months ago
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆52Updated last year
- [ICCV 2023] PyTorch implementation of RandBox☆52Updated last year
- Fast and general video object segmentation evaluation.☆28Updated 9 months ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆47Updated last year
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆57Updated 2 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆25Updated 8 months ago
- [AAAI 2024] Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆69Updated 4 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆45Updated 4 months ago
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆35Updated 3 weeks ago
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆23Updated 2 years ago
- ☆29Updated 8 months ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆24Updated 9 months ago
- [CVPR 2024] Exploring Orthogonality in Open World Object Detection☆37Updated 5 months ago
- ☆32Updated 11 months ago
- A Siamese self-supervised pretraining approach for the Transformer architecture in DETR☆34Updated last year
- ☆32Updated 2 years ago
- Tracking with Human-Intent Reasoning☆66Updated 3 weeks ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆55Updated last month
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆35Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆70Updated 3 months ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated 10 months ago
- ☆16Updated 2 years ago
- [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets☆34Updated 3 months ago
- ICCV'2023 | CTVIS: Consistent Training for Online Video Instance Segmentation☆70Updated last year
- ☆16Updated last month
- Improving Mamaba performance on Video Understanding task☆32Updated last month
- ☆22Updated 5 months ago
- [CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between Salient Points and Queries-Based Transformer Detector for Fast…☆29Updated last year