baaivision / URSALinks
[ICLR 2026] π» Uniform Discrete Diffusion with Metric Path for Video Generation
β102Updated this week
Alternatives and similar repositories for URSA
Users that are interested in URSA are comparing it to the libraries listed below
Sorting:
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-projectβ184Updated 10 months ago
- Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiTβ163Updated 3 months ago
- Official Repo for Self-Forcing++ High Quality Long Video Generationβ233Updated 3 months ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Modelsβ162Updated last month
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)β86Updated 11 months ago
- Scaling Text-to-Image Diffusion Transformers with Representation Autoencodersβ202Updated last week
- [NeurIPS 2025] The official repository of "Sekai: A Video Dataset towards World Exploration"β255Updated last month
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"β53Updated last week
- β52Updated last year
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generationβ46Updated 5 months ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesisβ62Updated 9 months ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151β90Updated 9 months ago
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Textβ53Updated 10 months ago
- [ICCV 2025] The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"β58Updated 10 months ago
- [ICLR 2026] Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.β152Updated 2 weeks ago
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformersβ77Updated 6 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"β198Updated last month
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"β172Updated last month
- [AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devicesβ93Updated 2 months ago
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".β132Updated last month
- Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPOβ92Updated 2 months ago
- Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?β216Updated last month
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representationsβ200Updated 4 months ago
- β141Updated 3 months ago
- Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Geneβ¦β298Updated this week
- β88Updated 2 months ago
- [NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesisβ111Updated 3 months ago
- β35Updated last month
- Official respository for ReasonGen-R1β74Updated 7 months ago
- Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"β185Updated last month