FuchenUSTC / DTF
☆16Updated 2 years ago
Alternatives and similar repositories for DTF:
Users that are interested in DTF are comparing it to the libraries listed below
- This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"☆19Updated last year
- Unofficial implementation of "SSAN: Separable Self-Attention Network for Video Representation Learning (CVPR2021)", in Pytorch☆8Updated 3 years ago
- ☆47Updated 2 years ago
- Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency☆17Updated 3 years ago
- [AAAI2023] Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition (SloshNet)☆12Updated last year
- Official Implementation of "Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning" in AAAI2024.☆13Updated 10 months ago
- Official code for ECCV 2022 paper☆31Updated 7 months ago
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆32Updated last year
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆32Updated 2 years ago
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆19Updated 2 years ago
- LongShortNet for Streaming Perception task.☆13Updated last year
- Code for "Learning Temporally and Semantically Consistent Unpaired Video-to-video Translation Through Pseudo-Supervision From Synthetic O…☆20Updated 5 months ago
- Adaptive Split-Fusion Transformer (ICME 2023 Oral)☆15Updated 10 months ago
- [ICCV2023] Spatio-temporal Prompting Network for Robust Video Feature Extraction☆10Updated last year
- Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)☆29Updated 3 years ago
- Pytorch implementation of Mix-Shifting-MLP (MS-MLP)☆16Updated 2 years ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆35Updated last year
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localization☆50Updated last year
- Lightweight Transformer for Multi-modal Tasks☆15Updated 2 years ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆68Updated last year
- The official project for the paper: Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation, CVPR 2022☆14Updated 2 years ago
- [TPAMI 2022 & CVPR 2020 Oral] Dynamic Graph Message Passing Networks☆30Updated 2 years ago
- Repository for the CVPR23 paper Re^2TAL☆12Updated 9 months ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated 11 months ago
- [CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation☆25Updated 7 months ago
- SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization☆27Updated 2 years ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆46Updated last year
- ☆44Updated last year
- [ICCV 23]This is a Pytorch implementation of our paper "SMMix: Self-Motivated Image Mixing for Vision Transformers"☆16Updated last year
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆14Updated last year