nku-zhichengzhang / ExtDMLinks
[CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"
☆54Updated 5 months ago
Alternatives and similar repositories for ExtDM
Users that are interested in ExtDM are comparing it to the libraries listed below
Sorting:
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆102Updated last year
- ☆35Updated 2 years ago
- This is the official repo of MMVP: motion-matrix-based video prediction (ICCV 2023)☆42Updated 2 years ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆59Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆87Updated 8 months ago
- The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis☆225Updated last year
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆85Updated last year
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆126Updated last year
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆49Updated 2 months ago
- A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)☆339Updated 8 months ago
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]☆101Updated last year
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆45Updated last year
- ☆38Updated last year
- This is the official implementation for ControlVAR.☆125Updated last year
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆40Updated 8 months ago
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆87Updated 6 months ago
- Text-Image Alignment for Diffusion-based Perception (TADP) - CVPR 2024☆40Updated last year
- ☆12Updated last year
- [ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning…☆29Updated 9 months ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆35Updated last year
- Frequency Autoregressive Image Generation with Continuous Tokens☆93Updated 6 months ago
- Official code for "DiffX: Guide Your Layout to Cross-Modal Generative Modeling"☆22Updated 9 months ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆37Updated 2 years ago
- [NeurIPS 2025 Oral] Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think☆198Updated 2 months ago
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆47Updated 7 months ago
- The official implementation of "[MASK] is All You Need"☆125Updated 4 months ago
- [CVPR 2025] HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation☆58Updated 5 months ago
- [NeurIPS 2024] SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow☆36Updated last year
- Transactions on Multimedia (TMM25)☆18Updated 8 months ago
- (NeurIPS 2025) Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation☆55Updated last month