microsoft / Reducio-VAE
☆182Updated last month
Alternatives and similar repositories for Reducio-VAE:
Users that are interested in Reducio-VAE are comparing it to the libraries listed below
- ☆221Updated 6 months ago
- This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolu…☆124Updated 2 weeks ago
- NOVA: Autoregressive Video Generation without Vector Quantization☆314Updated this week
- The official implementation of PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/☆106Updated 2 weeks ago
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆153Updated 9 months ago
- ☆253Updated 2 weeks ago
- ☆107Updated 10 months ago
- Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Vi…☆117Updated 2 months ago
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆88Updated this week
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'☆102Updated last month
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆123Updated 3 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆226Updated 4 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆184Updated 2 weeks ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision"☆67Updated last month
- [arXiv'25] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆205Updated this week
- Rectified Diffusion: Straightness Is Not Your Need☆161Updated last month
- Author's Implementation for E-LatentLPIPS☆127Updated 2 months ago
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆134Updated 2 months ago
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆257Updated last month
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆133Updated 7 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 6 months ago
- Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆109Updated this week
- GenEval: An object-focused framework for evaluating text-to-image alignment☆143Updated 5 months ago
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆276Updated 3 weeks ago
- ☆59Updated 5 months ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆101Updated 3 months ago
- Official implementation of Image Conductor: Precision Control for Interactive Video Synthesis☆83Updated 5 months ago
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆79Updated 8 months ago
- Paint by Inpaint: Learning to Add Image Objects by Removing Them First☆93Updated 4 months ago
- Subjects200K dataset☆90Updated this week