AIDC-AI / Ovis-ImageLinks
Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering, designed to operate efficiently under stringent computational constraints.
☆279Updated this week
Alternatives and similar repositories for Ovis-Image
Users that are interested in Ovis-Image are comparing it to the libraries listed below
Sorting:
- ☆96Updated last month
- Calligrapher: Freestyle Text Image Customization☆295Updated 3 months ago
- ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation☆633Updated last month
- ☆166Updated 2 weeks ago
- ☆316Updated 3 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement☆281Updated 2 months ago
- Code for CineScale, higher-resolution video generation based on Wan☆181Updated 4 months ago
- High-Quality Text-to-Video Generation with Alpha Channel☆307Updated last week
- 🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward☆176Updated 3 months ago
- Official implementation for "DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion".☆318Updated last month
- Mobius: Text to Seamless Looping Video Generation via Latent Shift☆168Updated 7 months ago
- [SIGGRAPH Asia 25] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off☆331Updated 2 months ago
- ☆227Updated 5 months ago
- [SIGGRAPH 2025] Official code of the paper "Cobra: Efficient Line Art COlorization with BRoAder References". Cobra:利用更广泛参考图实现高效线稿上色☆233Updated 2 weeks ago
- [Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆539Updated last month
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆181Updated 5 months ago
- Lynx: Towards High-Fidelity Personalized Video Generation☆296Updated 2 months ago
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."☆417Updated 6 months ago
- ☆112Updated 8 months ago
- We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while a…☆83Updated this week
- Official implementation of "Normalized Attention Guidance"☆175Updated 5 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆270Updated 6 months ago
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additi…☆318Updated 4 months ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models☆90Updated 3 months ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆85Updated last week
- One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer☆392Updated 2 weeks ago
- Pusa: Thousands Timesteps Video Diffusion Model☆668Updated 3 months ago
- project for skyreels-a3☆78Updated 4 months ago
- FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generation…☆287Updated 3 weeks ago
- SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization (ICCV 2025)☆150Updated 2 months ago