AIDC-AI / Ovis-ImageLinks
Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering, designed to operate efficiently under stringent computational constraints.
☆304Updated last month
Alternatives and similar repositories for Ovis-Image
Users that are interested in Ovis-Image are comparing it to the libraries listed below
Sorting:
- Lynx: Towards High-Fidelity Personalized Video Generation☆308Updated 4 months ago
- ☆328Updated 4 months ago
- ☆183Updated 2 months ago
- SpotEdit:Selective Region Editing in Diffusion Transformers☆171Updated last month
- [ICLR 2026] ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆668Updated 2 months ago
- project for skyreels-a3☆78Updated 6 months ago
- High-Quality Text-to-Video Generation with Alpha Channel☆329Updated last month
- Calligrapher: Freestyle Text Image Customization☆295Updated 5 months ago
- One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer☆436Updated last month
- We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while a…☆434Updated last month
- FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generation…☆304Updated last month
- Official implementation for "DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion".☆337Updated 2 months ago
- Code for CineScale, higher-resolution video generation based on Wan☆183Updated 5 months ago
- ☆100Updated 3 months ago
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆293Updated last month
- Mobius: Text to Seamless Looping Video Generation via Latent Shift☆172Updated 9 months ago
- 🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward☆179Updated 4 months ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆566Updated 3 months ago
- ☆227Updated 6 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement☆285Updated 3 weeks ago
- HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation☆672Updated 3 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆271Updated 8 months ago
- ☆175Updated 3 months ago
- In-context subject-driven image generation while preserving foreground fidelity☆351Updated 8 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆184Updated 6 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆672Updated this week
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."☆425Updated 8 months ago
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆229Updated 5 months ago
- [SIGGRAPH Asia 25] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off☆333Updated 3 months ago
- Official implementation of "VideoMaMa: Mask-Guided Video Matting via Generative Prior"☆222Updated this week