☆81Oct 18, 2025Updated 4 months ago
Alternatives and similar repositories for DC-AR
Users that are interested in DC-AR are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆727Nov 27, 2025Updated 3 months ago
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆37Oct 28, 2024Updated last year
- official implementation of "CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusi…☆18Sep 5, 2024Updated last year
- StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation☆43Jun 6, 2025Updated 9 months ago
- Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆40Oct 17, 2025Updated 4 months ago
- Official Implementation of "Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry"☆31Nov 10, 2025Updated 3 months ago
- 🌟Official code of our AAAI26 paper 🔍WebFilter☆37Nov 9, 2025Updated 3 months ago
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆30Sep 11, 2025Updated 5 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated 11 months ago
- EraseAnything, ICML 2025☆39Sep 28, 2025Updated 5 months ago
- [ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆91Feb 7, 2026Updated 3 weeks ago
- Pixel-Space Generative Models☆303May 11, 2025Updated 9 months ago
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆88Apr 10, 2025Updated 10 months ago
- Official PyTorch implementation of FlowMo.☆114Apr 7, 2025Updated 10 months ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆166Jan 31, 2025Updated last year
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆174Jun 26, 2025Updated 8 months ago
- RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space☆39Oct 16, 2025Updated 4 months ago
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆27Oct 13, 2024Updated last year
- [Preprint] UCGM: Unified Continuous Generative Models☆182May 27, 2025Updated 9 months ago
- [ICIP 2025] Scribble-Guided Diffusion for Training-free Text-to-Image Generation☆24Oct 2, 2024Updated last year
- A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders☆25Feb 21, 2025Updated last year
- [AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices☆95Nov 30, 2025Updated 3 months ago
- Extreme Image Compression using Fine-tuned VQGAN Models (DCC 2024)☆23Jan 14, 2025Updated last year
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆184Mar 20, 2025Updated 11 months ago
- Official implementation of FouriScale (ECCV2024)☆159Jul 27, 2024Updated last year
- ☆26Jun 20, 2024Updated last year
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning☆237May 30, 2025Updated 9 months ago
- PyTorch Implementation of “SMC++: Masked Learning of Unsupervised Video Semantic Compression", an extended version of ICCV 2023 paper "No…☆36Jan 11, 2026Updated last month
- ☆39May 20, 2025Updated 9 months ago
- Official implementary of HCoG: Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D Generation [CVPR 2025]☆58Jul 28, 2025Updated 7 months ago
- Pose Extraction & Rendering for SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representat…☆184Dec 28, 2025Updated 2 months ago
- ☆183Jun 27, 2025Updated 8 months ago
- TerDiT: Ternary Diffusion Models with Transformers☆74Jun 17, 2024Updated last year
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Jul 17, 2023Updated 2 years ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆997Nov 25, 2025Updated 3 months ago
- ☆63Jul 11, 2025Updated 7 months ago
- ☆32May 3, 2024Updated last year
- The official code implementation of "Towards Interactive Image Inpainting via Sketch Refinement".☆47Dec 11, 2025Updated 2 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆86Feb 27, 2025Updated last year