Visual-AI / JoVALinks
JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
☆28Updated 3 weeks ago
Alternatives and similar repositories for JoVA
Users that are interested in JoVA are comparing it to the libraries listed below
Sorting:
- [Arxiv 2025] SparseD: Sparse Attention for Diffusion Language Models☆54Updated 3 months ago
- The official repo of continuous speculative decoding☆31Updated 9 months ago
- ☆39Updated 7 months ago
- ☆63Updated 6 months ago
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆51Updated last year
- VideoNSA: Native Sparse Attention Scales Video Understanding☆78Updated 2 months ago
- ☆132Updated 6 months ago
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆46Updated last month
- Glance: Accelerating Diffusion Models with 1 Sample☆147Updated 3 weeks ago
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆53Updated 9 months ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Updated 8 months ago
- ☆140Updated 3 months ago
- 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation☆89Updated 3 weeks ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆42Updated 10 months ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆129Updated 6 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆71Updated 2 months ago
- [ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆48Updated 8 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆99Updated 3 months ago
- Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache☆54Updated 3 months ago
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆46Updated 6 months ago
- Test-time Scaling for VAR models☆29Updated 4 months ago
- ☆35Updated last month
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆63Updated 6 months ago
- [NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆73Updated 4 months ago
- The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …☆50Updated 7 months ago
- Vico: Compositional Video Generation as Flow Equalization☆58Updated last year
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆40Updated 11 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69Updated 8 months ago
- This is the offical repository of InfiniteVL☆71Updated last month
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"☆58Updated 6 months ago