AIDC-AI / Ovis-U1Links
An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerful framework.
☆338Updated this week
Alternatives and similar repositories for Ovis-U1
Users that are interested in Ovis-U1 are comparing it to the libraries listed below
Sorting:
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆629Updated last week
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆278Updated 2 weeks ago
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆168Updated 2 weeks ago
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation☆192Updated 4 months ago
- ☆229Updated last month
- [ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image…☆320Updated this week
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from u…☆205Updated 2 months ago
- Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation☆316Updated last month
- [ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen…☆246Updated last month
- FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆436Updated 4 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆302Updated 5 months ago
- [ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video …☆139Updated 3 months ago
- Official Implementation: Training-Free Efficient Video Generation via Dynamic Token Carving☆214Updated 2 weeks ago
- VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning☆258Updated 2 months ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆311Updated 11 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆200Updated 3 weeks ago
- ☆111Updated 3 weeks ago
- Personalize Anything for Free with Diffusion Transformer☆334Updated 3 months ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆207Updated 3 months ago
- 🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"☆122Updated 2 weeks ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆153Updated 2 weeks ago
- Repo for SeedVR2 & SeedVR (CVPR2025 Highlight)☆308Updated last week
- Let's finetune video generation models!☆486Updated 2 months ago
- An Efficient Text-to-Image Generation Pretrain Pipeline☆109Updated 2 months ago
- ☆174Updated this week
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆126Updated 4 months ago
- ☆541Updated 7 months ago
- ImgEdit: A Unified Image Editing Dataset and Benchmark☆138Updated last week
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆234Updated 6 months ago
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆234Updated 3 months ago