[ICLR 2026] π» Uniform Discrete Diffusion with Metric Path for Video Generation
β123May 20, 2026Updated last month
Alternatives and similar repositories for URSA
Users that are interested in URSA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] Autoregressive Video Generation without Vector Quantizationβ652Oct 29, 2025Updated 8 months ago
- EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memoryβ71Jan 13, 2026Updated 5 months ago
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examplesβ67Oct 29, 2024Updated last year
- β39Mar 5, 2026Updated 3 months ago
- β36Oct 21, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2024 Spotlight] The official repo for the paper "De novo Protein Design using Geometric Vector Field Networks".β31Aug 23, 2024Updated last year
- [ICML 2024] Floating Anchor Diffusion Model for Multi-motif Scaffoldingβ34Aug 23, 2024Updated last year
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"β50Aug 19, 2024Updated last year
- β19Aug 1, 2025Updated 11 months ago
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)β53Jul 6, 2025Updated 11 months ago
- β49Oct 6, 2024Updated last year
- [ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPOβ81Apr 30, 2026Updated 2 months ago
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".β66Mar 5, 2026Updated 3 months ago
- UniVid: The Open-Source Unified Video Modelβ33Oct 13, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memoryβ19Apr 9, 2025Updated last year
- β28Dec 19, 2025Updated 6 months ago
- β23Jul 5, 2025Updated 11 months ago
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perceptionβ159Dec 6, 2024Updated last year
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"β65Jul 1, 2025Updated last year
- Wan: Open and Advanced Large-Scale Video Generative Modelsβ31Jul 28, 2025Updated 11 months ago
- β48Oct 29, 2025Updated 8 months ago
- [CVPR 2026] Real2Edit2Real: Generating Robotic Demonstrations via a 3D Control Interfaceβ82Mar 10, 2026Updated 3 months ago
- Flux training codes (lora) for UniTEXβ25Jun 8, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Modelβ107Mar 24, 2025Updated last year
- [AAAI 2026] GenMAC for Compositional Text-to-Video Generationβ35Jan 10, 2026Updated 5 months ago
- [NeurIPS 2024] AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videosβ24Dec 6, 2024Updated last year
- [ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generationβ426Apr 25, 2025Updated last year
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Compositionβ178Sep 1, 2025Updated 10 months ago
- Native Multimodal Models are World Learnersβ1,528Dec 30, 2025Updated 6 months ago
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformersβ42Jul 23, 2025Updated 11 months ago
- SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splattingβ57Jul 21, 2025Updated 11 months ago
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"β19May 2, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- the official repo for "D-AR: Diffusion via Autoregressive Models"β139Jan 29, 2026Updated 5 months ago
- Official implementation of "CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models"β55May 26, 2026Updated last month
- [ICLR 2026] Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"β123Jan 10, 2026Updated 5 months ago
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Groundingβ77Jun 26, 2025Updated last year
- This repository contains the code for the paper βNeuro-Symbolic Query Compilerβ, accepted to the Findings of ACL 2025.β17Oct 20, 2025Updated 8 months ago
- [NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perceptionβ316Sep 21, 2025Updated 9 months ago
- [ECCV 2026] Towards Scalable Pre-training of Visual Tokenizers for Generationβ495Apr 15, 2026Updated 2 months ago