CASIA-IVA-Lab / MOSOLinks
☆35Updated 2 years ago
Alternatives and similar repositories for MOSO
Users that are interested in MOSO are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"☆55Updated 7 months ago
- ☆38Updated last year
- This is the official repo of MMVP: motion-matrix-based video prediction (ICCV 2023)☆42Updated 2 years ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆37Updated 2 years ago
- [CVPRW'23] "A unified model for continuous conditional video prediction". Xi Ye, Guillaume-Alexandre Bilodeau.☆14Updated last year
- Official implementation of "Can Language Understand Depth?"☆84Updated 3 years ago
- ☆16Updated 3 years ago
- Improving Mamaba performance on Video Understanding task☆44Updated last month
- A PyTorch implementation of TVC☆24Updated 2 years ago
- [CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation☆29Updated 9 months ago
- ☆19Updated last year
- ☆40Updated last year
- Test-Time Training on Video Streams☆66Updated 2 years ago
- ☆33Updated last year
- (ICLR 2024, CVPR 2024) SparseFormer☆75Updated last year
- SportsSloMo: A New Benchmark and Baseline Models for Human-centric Video Frame Interpolation, CVPR 2024 (https://arxiv.org/abs/2308.16876…☆77Updated last year
- [AAAI 2024] "LDMVFI: Video Frame Interpolation with Latent Diffusion Models", Duolikun Danier, Fan Zhang, David Bull☆181Updated 2 years ago
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆47Updated last year
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆35Updated last year
- [ICCV2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆41Updated 2 years ago
- 「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆82Updated 7 months ago
- ☆31Updated 2 years ago
- [CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos☆12Updated last year
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆80Updated last year
- Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation (ICCV 2023)☆66Updated 2 years ago
- Official Implementation for "Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation", CVPR 2023.☆54Updated 2 years ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆86Updated 9 months ago
- [CVPR'24] Neural Clustering based Visual Representation Learning☆44Updated 3 months ago
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆105Updated last year
- code for paper: Simultaneous Image to Zero and Zero to Noise: Diffusion Models with Analytical Image Attenuation☆60Updated 2 weeks ago