[ICLR 2026] π» Uniform Discrete Diffusion with Metric Path for Video Generation
β106Feb 6, 2026Updated 3 weeks ago
Alternatives and similar repositories for URSA
Users that are interested in URSA are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)β48Apr 14, 2025Updated 10 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantizationβ629Oct 29, 2025Updated 4 months ago
- Repo of HawkLlama.β16Jan 2, 2025Updated last year
- EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memoryβ62Jan 13, 2026Updated last month
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformersβ41Jul 23, 2025Updated 7 months ago
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"β48Aug 19, 2024Updated last year
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generationβ30Dec 22, 2025Updated 2 months ago
- A quick test using a Stable Diffusion server and Godot 4β11Mar 17, 2023Updated 2 years ago
- β13Jun 22, 2025Updated 8 months ago
- β39Oct 29, 2025Updated 4 months ago
- β23Dec 19, 2025Updated 2 months ago
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approachβ14Apr 2, 2025Updated 11 months ago
- ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPOβ79Nov 17, 2025Updated 3 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesisβ66May 7, 2025Updated 9 months ago
- Official pytorch implementation of "AlphaFlow: Understanding and Improving MeanFlow Models"β99Oct 24, 2025Updated 4 months ago
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examplesβ65Oct 29, 2024Updated last year
- [ICML 2024] Floating Anchor Diffusion Model for Multi-motif Scaffoldingβ31Aug 23, 2024Updated last year
- This repository contains the code for the paper βNeuro-Symbolic Query Compilerβ, accepted to the Findings of ACL 2025.β16Oct 20, 2025Updated 4 months ago
- Flux training codes (lora) for UniTEXβ23Jun 8, 2025Updated 8 months ago
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editingβ38Jan 9, 2026Updated last month
- On Path to Multimodal Generalist: General-Level and General-Benchβ18Jul 11, 2025Updated 7 months ago
- [ICLR 2024 Spotlight] The official repo for the paper "De novo Protein Design using Geometric Vector Field Networks".β30Aug 23, 2024Updated last year
- β39Dec 8, 2023Updated 2 years ago
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"β62Jul 1, 2025Updated 8 months ago
- β20Jun 16, 2025Updated 8 months ago
- OneTo3D: One Image to Editable Dynamic 3D Model and Video Generationβ15May 15, 2024Updated last year
- Mixture-of-Groups Attention for End-to-End Long Video Generationβ92Oct 22, 2025Updated 4 months ago
- β37Oct 21, 2022Updated 3 years ago
- UniVid: The Open-Source Unified Video Modelβ30Oct 13, 2025Updated 4 months ago
- Wan: Open and Advanced Large-Scale Video Generative Modelsβ28Jul 28, 2025Updated 7 months ago
- Build a skeleton using Blender and register it to human mesh.β16May 29, 2022Updated 3 years ago
- lite attention implemented over flash attention 3β45Updated this week
- the official repo for "D-AR: Diffusion via Autoregressive Models"β135Jan 29, 2026Updated last month
- KMM: Key Frame Mask Mamba for Extended Motion Generationβ19Sep 22, 2025Updated 5 months ago
- codes for Makeup Extraction of 3D Representation via Illumination-Aware Image Decomposition (Eurographics2023)β18Mar 2, 2025Updated last year
- Cost-Sensitive Toolpath Agent for Multi-turn Image Editingβ26Mar 26, 2025Updated 11 months ago
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memoryβ19Apr 9, 2025Updated 10 months ago
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perceptionβ159Dec 6, 2024Updated last year
- [arXiv'25] AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidanceβ41Feb 19, 2025Updated last year