nku-zhichengzhang / ExtDM
[CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"
☆45Updated 4 months ago
Alternatives and similar repositories for ExtDM:
Users that are interested in ExtDM are comparing it to the libraries listed below
- This is the official repo of MMVP: motion-matrix-based video prediction (ICCV 2023)☆37Updated last year
- ☆34Updated last year
- [NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆90Updated 9 months ago
- ☆40Updated 5 months ago
- ☆15Updated 11 months ago
- ☆37Updated 9 months ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆35Updated last year
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆34Updated 10 months ago
- Improving Mamaba performance on Video Understanding task☆38Updated 4 months ago
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆20Updated 7 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆74Updated 6 months ago
- This is the official implementation for ControlVAR.☆96Updated 3 months ago
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆70Updated 10 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆68Updated 5 months ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆107Updated 2 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆106Updated 3 months ago
- SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution☆112Updated 11 months ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆51Updated 11 months ago
- [CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation☆52Updated 2 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆48Updated 6 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆27Updated last year
- [CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation☆26Updated 9 months ago
- [ECCV2024]The official implementation of the DiffPNG paper in PyTorch.☆11Updated 4 months ago
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆34Updated last year
- The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis☆185Updated 8 months ago
- ☆15Updated last year
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆77Updated 2 months ago
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆69Updated 8 months ago
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆106Updated last week