iva-mzsun / MOSO
☆34Updated last year
Alternatives and similar repositories for MOSO
Users that are interested in MOSO are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"☆48Updated 6 months ago
- This is the official repo of MMVP: motion-matrix-based video prediction (ICCV 2023)☆38Updated last year
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆35Updated last year
- Official implementation of "Can Language Understand Depth?"☆81Updated 2 years ago
- A PyTorch implementation of TVC☆24Updated last year
- Improving Mamaba performance on Video Understanding task☆39Updated 6 months ago
- [WACV2025] Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)☆79Updated 10 months ago
- ☆37Updated 11 months ago
- ☆17Updated 2 years ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Updated last year
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆35Updated last year
- ☆61Updated last year
- ☆47Updated 2 years ago
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆20Updated 9 months ago
- ☆35Updated 2 weeks ago
- [CVPRW'23] "A unified model for continuous conditional video prediction". Xi Ye, Guillaume-Alexandre Bilodeau.☆14Updated last year
- ☆42Updated 7 months ago
- 「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆79Updated 10 months ago
- (ICLR 2024, CVPR 2024) SparseFormer☆74Updated 6 months ago
- [CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation☆26Updated 2 weeks ago
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆35Updated last year
- ECCV 2024 paper template☆50Updated last year
- [CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos☆12Updated 11 months ago
- [ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model☆20Updated 6 months ago
- [ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding☆21Updated last year
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆75Updated last year
- [ECCV2022] Global Spectral Filter Memory Network for Video Object Segmentation☆40Updated 2 years ago
- SportsSloMo: A New Benchmark and Baseline Models for Human-centric Video Frame Interpolation, CVPR 2024 (https://arxiv.org/abs/2308.16876…☆74Updated last year
- FQGAN: Factorized Visual Tokenization and Generation☆50Updated last month
- ☆32Updated last year