AIDC-AI / Ovis-ImageLinks
Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering, designed to operate efficiently under stringent computational constraints.
☆144Updated this week
Alternatives and similar repositories for Ovis-Image
Users that are interested in Ovis-Image are comparing it to the libraries listed below
Sorting:
- ☆227Updated 4 months ago
- ☆95Updated 3 weeks ago
- project for skyreels-a3☆78Updated 3 months ago
- ☆315Updated 2 months ago
- Lynx: Towards High-Fidelity Personalized Video Generation☆288Updated 2 months ago
- [ArXiv 25] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling☆556Updated last month
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models☆90Updated 2 months ago
- [SIGGRAPH Asia 25] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off☆323Updated last month
- High-Quality Text-to-Video Generation with Alpha Channel☆289Updated 2 months ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆509Updated last month
- Calligrapher: Freestyle Text Image Customization☆294Updated 3 months ago
- ☆166Updated 3 weeks ago
- ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆614Updated 2 weeks ago
- ☆112Updated 7 months ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆659Updated last month
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆269Updated 5 months ago
- 🎨 A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space☆146Updated this week
- iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation☆162Updated this week
- Code for CineScale, higher-resolution video generation based on Wan☆177Updated 3 months ago
- Official Repository of "OmniTry: Virtual Try-On Anything without Masks"☆231Updated 3 months ago
- FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generation…☆280Updated last week
- 🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward☆169Updated 2 months ago
- Official implementation of DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation☆118Updated 10 months ago
- Text and image to video generation: Kandinsky 4.0 (2024)☆149Updated 11 months ago
- Official implementation of "Normalized Attention Guidance"☆174Updated 5 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆665Updated 3 months ago
- [SIGGRAPH 2025] Official code of the paper "Cobra: Efficient Line Art COlorization with BRoAder References". Cobra:利用更广泛参考图实现高效线稿上色☆230Updated 7 months ago
- SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization (ICCV 2025)☆148Updated last month
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆179Updated 3 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆179Updated 4 months ago