iva-mzsun / MOSO
☆33Updated last year
Related projects ⓘ
Alternatives and complementary repositories for MOSO
- [CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"☆33Updated 3 weeks ago
- This is the official repo of MMVP: motion-matrix-based video prediction (ICCV 2023)☆36Updated last year
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆35Updated last year
- ☆47Updated 2 years ago
- [CVPRW'23] "A unified model for continuous conditional video prediction". Xi Ye, Guillaume-Alexandre Bilodeau.☆13Updated 7 months ago
- Official implementation of "Can Language Understand Depth?"☆76Updated 2 years ago
- ☆36Updated 5 months ago
- ☆31Updated 8 months ago
- [AAAI 2024] Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆69Updated 4 months ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆61Updated 7 months ago
- ☆33Updated last month
- [ICCV 2023] Official PyTorch implementation of the paper "DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion"☆32Updated last year
- [CVPR'24] Neural Clustering based Visual Representation Learning☆37Updated 7 months ago
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆12Updated 2 weeks ago
- Official implementation of "ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video" (ECCV2024)☆18Updated 3 months ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆33Updated 2 weeks ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆39Updated last year
- ☆32Updated 11 months ago
- [ECCV2022] Global Spectral Filter Memory Network for Video Object Segmentation☆38Updated 2 years ago
- [CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation☆25Updated 5 months ago
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆23Updated last year
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆25Updated 8 months ago
- Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)☆65Updated 4 months ago
- ☆31Updated 6 months ago
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆32Updated 2 years ago
- ☆58Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆70Updated 3 months ago
- Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation (ICCV 2023)☆63Updated last year
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆65Updated 7 months ago