nku-zhichengzhang / ExtDM
[CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"
☆44Updated 3 months ago
Alternatives and similar repositories for ExtDM:
Users that are interested in ExtDM are comparing it to the libraries listed below
- ☆34Updated last year
- This is the official repo of MMVP: motion-matrix-based video prediction (ICCV 2023)☆37Updated last year
- ☆37Updated 8 months ago
- This is the official implementation for ControlVAR.☆95Updated 2 months ago
- This repository contains the official implementation of "FlowIE: Efficient Image Enhancement via Rectified Flow"☆96Updated last month
- ☆38Updated 4 months ago
- ☆15Updated 10 months ago
- [NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆89Updated 8 months ago
- Official code for "DiffX: Guide Your Layout to Cross-Modal Generative Modeling"☆19Updated 2 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆73Updated 6 months ago
- [CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation☆25Updated 8 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆27Updated 11 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆102Updated 2 months ago
- SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution☆110Updated 10 months ago
- Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)☆76Updated 7 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆67Updated 4 months ago
- [CVPR 2024] Solving Masked Jigsaw Puzzles with Diffusion Vision Transformers☆22Updated 8 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆47Updated 6 months ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆47Updated 10 months ago
- [CVPR'24] Neural Clustering based Visual Representation Learning☆40Updated 10 months ago
- The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis☆182Updated 7 months ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆35Updated last year
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆19Updated 6 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆42Updated last month
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction☆21Updated 8 months ago
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆68Updated 8 months ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆105Updated last month
- [CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation☆49Updated last month
- [ICLR2025] Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion☆90Updated 3 months ago
- Improving Mamaba performance on Video Understanding task☆35Updated 4 months ago