AIDC-AI / Ovis-ImageLinks
Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering, designed to operate efficiently under stringent computational constraints.
☆304Updated last month
Alternatives and similar repositories for Ovis-Image
Users that are interested in Ovis-Image are comparing it to the libraries listed below
Sorting:
- Lynx: Towards High-Fidelity Personalized Video Generation☆308Updated 4 months ago
- [ICLR 2026] ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆668Updated 2 months ago
- ☆183Updated 2 months ago
- We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while a…☆434Updated last month
- ☆328Updated 4 months ago
- Official implementation for "DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion".☆337Updated 2 months ago
- High-Quality Text-to-Video Generation with Alpha Channel☆329Updated last month
- Code for CineScale, higher-resolution video generation based on Wan☆183Updated 5 months ago
- project for skyreels-a3☆78Updated 6 months ago
- SpotEdit:Selective Region Editing in Diffusion Transformers☆171Updated last month
- One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer☆436Updated last month
- Calligrapher: Freestyle Text Image Customization☆296Updated 5 months ago
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆229Updated 5 months ago
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆297Updated last month
- [SIGGRAPH Asia 25] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off☆333Updated 3 months ago
- 🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward☆179Updated 4 months ago
- ☆100Updated 3 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement☆285Updated last month
- Mobius: Text to Seamless Looping Video Generation via Latent Shift☆172Updated 9 months ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆566Updated 3 months ago
- ☆227Updated 6 months ago
- 🎨 A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space☆154Updated 2 months ago
- ☆175Updated 3 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆184Updated 6 months ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models☆89Updated 5 months ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆672Updated 3 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆672Updated this week
- FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generation…☆304Updated last month
- iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation☆185Updated 2 months ago
- ☆85Updated 2 months ago