LiChenyang-Github / LongShortNet
LongShortNet for Streaming Perception task.
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for LongShortNet
- This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"☆19Updated last year
- Video Feature Enhancement with PyTorch☆24Updated 9 months ago
- Unifying Visual Perception by Dispersible Points Learning (ECCV 2022)☆51Updated 2 years ago
- ☆16Updated 2 years ago
- ☆22Updated 5 months ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆35Updated last year
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆13Updated 4 months ago
- Lightweight Transformer for Multi-modal Tasks☆15Updated last year
- ☆32Updated 2 years ago
- [AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhi…☆19Updated 3 months ago
- Teach-DETR: Better Training DETR with Teachers☆29Updated 8 months ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated 10 months ago
- [AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer☆56Updated 7 months ago
- [NeurIPS 2024] Hawk: Learning to Understand Open-World Video Anomalies☆20Updated last month
- ☆16Updated 2 years ago
- ☆32Updated 11 months ago
- ☆23Updated 3 weeks ago
- CVPR2022 - Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation☆23Updated 2 years ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆14Updated last year
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆52Updated last year
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆47Updated last year
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆27Updated 2 months ago
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆40Updated last year
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆32Updated last year
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆32Updated 2 years ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated 11 months ago
- ☆10Updated 10 months ago
- code base for vision transformers☆36Updated 2 years ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆35Updated last year
- SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality☆22Updated 2 months ago