baaivision/URSA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/baaivision/URSA)

baaivision / URSA

[ICLR 2026] 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation

☆123

Alternatives and similar repositories for URSA

Users that are interested in URSA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aim-uofa / GSI-Bench
View on GitHub
[CVPR2026] Exploring Spatial Intelligence from a Generative Perspective
☆30Jun 3, 2026Updated last month
Yovecent / UDM-GRPO
View on GitHub
[ICML 2026 Spotlight] UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models
☆27May 1, 2026Updated 2 months ago
aim-uofa / EvoTokenDLM
View on GitHub
[ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)
☆48Apr 7, 2026Updated 3 months ago
baaivision / NOVA
View on GitHub
[ICLR 2025] Autoregressive Video Generation without Vector Quantization
☆656Oct 29, 2025Updated 8 months ago
aim-uofa / OmniJigsaw
View on GitHub
☆35Apr 10, 2026Updated 3 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
aim-uofa / STAIR
View on GitHub
☆18Jun 13, 2026Updated last month
baaivision / CoS
View on GitHub
[NeurIPS 2025] Unveiling Chain of Step Reasoning for Vision-Language Models with Fine-grained Rewards
☆18Oct 6, 2025Updated 9 months ago
baaivision / Emu3.5
View on GitHub
Native Multimodal Models are World Learners
☆1,537Dec 30, 2025Updated 6 months ago
aim-uofa / StaMo
View on GitHub
Unsupervised Learning of Generalizable Robot Motion from Compact State Representation
☆40Jun 10, 2026Updated last month
aim-uofa / dLLM-MidTruth
View on GitHub
[ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".
☆66Mar 5, 2026Updated 4 months ago
KlingAIResearch / VMoBA
View on GitHub
Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"
☆64Jul 1, 2025Updated last year
aim-uofa / VLModel
View on GitHub
Repo of HawkLlama.
☆16Jan 2, 2025Updated last year
aim-uofa / TVRBench
View on GitHub
TVRBench: Target Viewpoint Reproduction Benchmark for Active Spatial Intelligence
☆25Jun 2, 2026Updated last month
aim-uofa / Tinker
View on GitHub
One-shot and Few-shot 3D Editing without Per-Scene Optimization
☆175Aug 21, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
aim-uofa / COSINE
View on GitHub
[ICCV'25] Unified Open-World Segmentation with Multi-Modal Prompts
☆16Jun 16, 2026Updated last month
FoundationVision / InfinityStar
View on GitHub
[NeurIPS 2025 Oral]Infinity⭐️: Uniﬁed Spacetime AutoRegressive Modeling for Visual Generation
☆773Apr 16, 2026Updated 3 months ago
fudoki-hku / FUDOKI
View on GitHub
[NeurIPS 2025 Spotlight] FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities
☆77Dec 21, 2025Updated 7 months ago
aim-uofa / SINE
View on GitHub
[NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples
☆68Oct 29, 2024Updated last year
aim-uofa / ReasonMatch
View on GitHub
[CVPR2026] Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching
☆19Jun 4, 2026Updated last month
aim-uofa / Active-o3
View on GitHub
[ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO
☆83Apr 30, 2026Updated 2 months ago
JIA-Lab-research / Jenga
View on GitHub
[NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving
☆287Aug 4, 2025Updated 11 months ago
QingZhong1996 / Awesome-Video-Instance-Segmentation-Papers
View on GitHub
☆36Oct 21, 2022Updated 3 years ago
vita-epfl / LayerSync
View on GitHub
[ICLR 2026] LayerSync: Self-aligning Intermediate Layers
☆22Mar 21, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Visual-AI / JoVA
View on GitHub
JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
☆33Dec 22, 2025Updated 7 months ago
TencentARC / RollingForcing
View on GitHub
[ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
☆444Oct 31, 2025Updated 8 months ago
aim-uofa / DiffewS
View on GitHub
[NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)
☆51Apr 14, 2025Updated last year
NVlabs / DiffusionNFT
View on GitHub
[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process
☆983Feb 10, 2026Updated 5 months ago
baaivision / DenseFusion
View on GitHub
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
☆159Dec 6, 2024Updated last year
Eyeline-Labs / VChain
View on GitHub
[ACL 2026 Findings, ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation
☆120Apr 8, 2026Updated 3 months ago
JaydenLyh / Reward-Forcing
View on GitHub
[CVPR 2026 Highlight] Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
☆352Dec 15, 2025Updated 7 months ago
ZitengWangNYU / Scale-RAE
View on GitHub
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
☆255Feb 13, 2026Updated 5 months ago
arielshaulov / TokenTrim
View on GitHub
Official implementation of the paper "TOKENTRIM: INFERENCE-TIME TOKEN PRUNING FOR AUTOREGRESSIVE LONG VIDEO GENERATION"
☆15Feb 8, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
EnVision-Research / MTI
View on GitHub
[ACL 2026] Official implementation of "Less is More: Improving LLM Reasoning with Minimal Test-Time Intervention"
☆41Apr 18, 2026Updated 3 months ago
yifan123 / flow_grpo
View on GitHub
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
☆2,430May 7, 2026Updated 2 months ago
aim-uofa / AGILE
View on GitHub
☆47May 6, 2026Updated 2 months ago
WeichenFan / UAE
View on GitHub
Official repo for UAE
☆207Jun 21, 2026Updated last month
tianweiy / CausVid
View on GitHub
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
☆1,408Aug 7, 2025Updated 11 months ago
apple / ml-atoken
View on GitHub
☆145Nov 8, 2025Updated 8 months ago
aim-uofa / GenDeF
View on GitHub
☆39Mar 5, 2026Updated 4 months ago