FuchenUSTC / DTF
☆16Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for DTF
- This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"☆19Updated last year
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated 10 months ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆35Updated last year
- LongShortNet for Streaming Perception task.☆11Updated last year
- ☆32Updated 11 months ago
- [ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding☆20Updated last year
- ☆47Updated 2 years ago
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆32Updated 2 years ago
- Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency☆17Updated 2 years ago
- [AAAI2023] Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition (SloshNet)☆12Updated 10 months ago
- [TPAMI 2023] Object Affinity Learning: Towards Annotation-free Instance Segmentation☆14Updated last year
- Pytorch implementation of Mix-Shifting-MLP (MS-MLP)☆16Updated 2 years ago
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆18Updated 2 years ago
- code base for vision transformers☆36Updated 2 years ago
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆26Updated last year
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localization☆50Updated last year
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆68Updated last year
- How Much Position Information Do Convolutional Neural Networks Encode?☆10Updated 3 years ago
- SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality☆22Updated 2 months ago
- Official code for ECCV 2022 paper☆30Updated 5 months ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 2 years ago
- ☆15Updated 2 years ago
- Code release for "BoxVIS: Video Instance Segmentation with Box Annotation"☆11Updated 11 months ago
- [ICCV 23]This is a Pytorch implementation of our paper "SMMix: Self-Motivated Image Mixing for Vision Transformers"☆17Updated last year
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆36Updated last month
- Official code for BA-SAM:Scalable Bias-Mode Attention Mask for Segment Anything Model☆13Updated 5 months ago
- The official project for the paper: Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation, CVPR 2022☆14Updated 2 years ago
- [NeurIPS 2024] Hawk: Learning to Understand Open-World Video Anomalies☆20Updated last month
- ☆33Updated last year
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆32Updated last year