A Unified Visual Generator with Interleaved OmniModal Context
☆185Feb 10, 2026Updated 2 weeks ago
Alternatives and similar repositories for VINO-code
Users that are interested in VINO-code are comparing it to the libraries listed below
Sorting:
- ☆86Feb 4, 2026Updated 3 weeks ago
- DreamStyle: A Unified Framework for Video Stylization☆109Jan 7, 2026Updated last month
- Official Implementation of ReCo: Region-Constraint In-Context Generation for Instructional Video Editing☆144Updated this week
- ☆13Jul 10, 2024Updated last year
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models☆154Sep 24, 2025Updated 5 months ago
- Scaling Zero-Shot Reference-to-Video Generation☆62Dec 11, 2025Updated 2 months ago
- https://little-misfit.github.io/GRAG-Image-Editing/☆115Nov 27, 2025Updated 3 months ago
- [ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos☆431Feb 11, 2026Updated 2 weeks ago
- [CVPR 2026] VideoCoF: Unified Video Editing with Temporal Reasoner☆142Feb 22, 2026Updated last week
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆302Dec 15, 2025Updated 2 months ago
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆32Dec 13, 2025Updated 2 months ago
- ☆20Jun 26, 2024Updated last year
- VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning☆59Nov 4, 2025Updated 3 months ago
- official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".☆46Dec 25, 2025Updated 2 months ago
- ☆197Feb 3, 2026Updated 3 weeks ago
- ☆130Dec 24, 2025Updated 2 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆69Updated this week
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆38Dec 30, 2025Updated 2 months ago
- ☆187Dec 10, 2025Updated 2 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆20Jan 26, 2025Updated last year
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆16Feb 4, 2024Updated 2 years ago
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆37Nov 26, 2025Updated 3 months ago
- Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Rou…☆34Sep 25, 2025Updated 5 months ago
- End2End Virtual Try-on with Visual Reference, CVPR2026☆58Nov 19, 2025Updated 3 months ago
- Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation☆110Updated this week
- MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head (ICLR 2026)☆123Feb 6, 2026Updated 3 weeks ago
- Animate Any Character in Any World☆90Jan 9, 2026Updated last month
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 3 months ago
- [CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length vide…☆446Feb 21, 2026Updated last week
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation☆721Nov 27, 2025Updated 3 months ago
- ☆53Dec 10, 2025Updated 2 months ago
- ☆296Feb 9, 2026Updated 3 weeks ago
- [CVPR 2026] OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer☆219Feb 21, 2026Updated last week
- ☆115Updated this week
- DeepVerse: 4D Autoregressive Video Generation as a World Model☆216Aug 11, 2025Updated 6 months ago
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆114Dec 11, 2025Updated 2 months ago
- [AAAI 2026] UltraGen☆77Feb 1, 2026Updated last month
- Pusa: Thousands Timesteps Video Diffusion Model☆671Feb 13, 2026Updated 2 weeks ago
- [CVPR 2026] SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time☆99Jan 1, 2026Updated 2 months ago