A Unified Visual Generator with Interleaved OmniModal Context
☆202Mar 5, 2026Updated 2 weeks ago
Alternatives and similar repositories for VINO-code
Users that are interested in VINO-code are comparing it to the libraries listed below
Sorting:
- ☆86Feb 4, 2026Updated last month
- DreamStyle: A Unified Framework for Video Stylization☆113Jan 7, 2026Updated 2 months ago
- Official Implementation of ReCo: Region-Constraint In-Context Generation for Instructional Video Editing☆150Mar 5, 2026Updated 2 weeks ago
- [ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos☆467Feb 11, 2026Updated last month
- https://little-misfit.github.io/GRAG-Image-Editing/☆116Nov 27, 2025Updated 3 months ago
- ☆13Jul 10, 2024Updated last year
- Scaling Zero-Shot Reference-to-Video Generation☆64Dec 11, 2025Updated 3 months ago
- [CVPR 2026] VideoCoF: Unified Video Editing with Temporal Reasoner☆158Feb 22, 2026Updated last month
- DeepVerse: 4D Autoregressive Video Generation as a World Model☆219Aug 11, 2025Updated 7 months ago
- [CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length vide…☆459Feb 21, 2026Updated last month
- ☆209Mar 9, 2026Updated last week
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆311Dec 15, 2025Updated 3 months ago
- End2End Virtual Try-on with Visual Reference, CVPR2026☆58Updated this week
- Official repository of paper "ProEdit: Inversion-based Editing From Prompts Done Right"☆116Feb 5, 2026Updated last month
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆114Dec 11, 2025Updated 3 months ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models☆156Mar 4, 2026Updated 2 weeks ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆44Aug 9, 2025Updated 7 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆69Feb 26, 2026Updated 3 weeks ago
- [CVPR 2026] OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer☆225Feb 21, 2026Updated last month
- Infinite-Forcing: Towards Infinite-Long Video Generation☆139Nov 13, 2025Updated 4 months ago
- Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Rou…☆34Sep 25, 2025Updated 5 months ago
- Code repo for EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation☆34Mar 6, 2026Updated 2 weeks ago
- [AAAI 2026] UltraGen☆77Feb 1, 2026Updated last month
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆133Aug 17, 2025Updated 7 months ago
- Animate Any Character in Any World☆96Mar 10, 2026Updated last week
- ☆191Dec 10, 2025Updated 3 months ago
- StreetSurfGS: Scalable Large Scene Surface Reconstruction with Gaussian Splatting for Urban Street Scences☆22Jun 12, 2024Updated last year
- ☆82Oct 13, 2025Updated 5 months ago
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆39Dec 30, 2025Updated 2 months ago
- CoV: Chain-of-View Prompting for Spatial Reasoning☆52Jan 23, 2026Updated last month
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- ☆53Dec 10, 2025Updated 3 months ago
- [CVPR'26] VecGlypher: Unified Vector Glyph Generation with Language Models☆104Feb 26, 2026Updated 3 weeks ago
- Benchmark dataset and code of MSRVTT-Personalization☆51Nov 10, 2025Updated 4 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆674Feb 13, 2026Updated last month
- Generative Omnimatte (CVPR 2025)☆168Jun 3, 2025Updated 9 months ago
- ☆322Feb 9, 2026Updated last month
- ☆130Feb 28, 2026Updated 3 weeks ago
- Code2Worlds: Empowering Coding LLMs for 4D World Generation☆91Feb 26, 2026Updated 3 weeks ago