[ICLR 2026] π» Uniform Discrete Diffusion with Metric Path for Video Generation
β119May 20, 2026Updated this week
Alternatives and similar repositories for URSA
Users that are interested in URSA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)β51Apr 14, 2025Updated last year
- Repo of HawkLlama.β16Jan 2, 2025Updated last year
- EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memoryβ68Jan 13, 2026Updated 4 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantizationβ648Oct 29, 2025Updated 6 months ago
- β14Jun 22, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generationβ33Dec 22, 2025Updated 5 months ago
- β36Oct 21, 2022Updated 3 years ago
- [ICLR 2024 Spotlight] The official repo for the paper "De novo Protein Design using Geometric Vector Field Networks".β31Aug 23, 2024Updated last year
- [ICML 2024] Floating Anchor Diffusion Model for Multi-motif Scaffoldingβ34Aug 23, 2024Updated last year
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"β50Aug 19, 2024Updated last year
- β19Aug 1, 2025Updated 9 months ago
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)β53Jul 6, 2025Updated 10 months ago
- β49Oct 6, 2024Updated last year
- [ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPOβ80Apr 30, 2026Updated 3 weeks ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".β64Mar 5, 2026Updated 2 months ago
- UniVid: The Open-Source Unified Video Modelβ32Oct 13, 2025Updated 7 months ago
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memoryβ19Apr 9, 2025Updated last year
- β28Dec 19, 2025Updated 5 months ago
- β23Jul 5, 2025Updated 10 months ago
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perceptionβ159Dec 6, 2024Updated last year
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Imagesβ61Nov 4, 2025Updated 6 months ago
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"β65Jul 1, 2025Updated 10 months ago
- Wan: Open and Advanced Large-Scale Video Generative Modelsβ29Jul 28, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [AAAI 2026] GenMAC for Compositional Text-to-Video Generationβ32Jan 10, 2026Updated 4 months ago
- β46Oct 29, 2025Updated 6 months ago
- [CVPR 2026] Real2Edit2Real: Generating Robotic Demonstrations via a 3D Control Interfaceβ78Mar 10, 2026Updated 2 months ago
- Flux training codes (lora) for UniTEXβ24Jun 8, 2025Updated 11 months ago
- [IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Modelβ107Mar 24, 2025Updated last year
- β20Jan 1, 2026Updated 4 months ago
- [NeurIPS 2024] AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videosβ24Dec 6, 2024Updated last year
- Native Multimodal Models are World Learnersβ1,512Dec 30, 2025Updated 4 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Compositionβ177Sep 1, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2026] IVEBench - Benchmark for Instruction-Guided Video Editingβ74Jan 28, 2026Updated 3 months ago
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformersβ41Jul 23, 2025Updated 10 months ago
- SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splattingβ57Jul 21, 2025Updated 10 months ago
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"β19May 2, 2025Updated last year
- A Mechanistic View on Video Generation as World Models: State and Dynamicsβ39May 18, 2026Updated last week
- the official repo for "D-AR: Diffusion via Autoregressive Models"β138Jan 29, 2026Updated 3 months ago
- Official implementation of "CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models"β50Feb 24, 2026Updated 3 months ago