ATH-MaaS/Ovis-Image

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ATH-MaaS/Ovis-Image)

ATH-MaaS / Ovis-Image

Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering, designed to operate efficiently under stringent computational constraints.

☆319

Alternatives and similar repositories for Ovis-Image

Users that are interested in Ovis-Image are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ATH-MaaS / Ovis-U1
View on GitHub
An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…
☆450Dec 2, 2025Updated 7 months ago
meituan-longcat / LongCat-Image
View on GitHub
☆709May 9, 2026Updated 2 months ago
ATH-MaaS / Wings
View on GitHub
The code repository for "Wings: Learning Multimodal LLMs without Text-only Forgetting" [NeurIPS 2024]
☆27Dec 28, 2024Updated last year
Francis-Rings / FlashPortrait
View on GitHub
[CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length vide…
☆479Feb 21, 2026Updated 5 months ago
Tencent-Hunyuan / HunyuanImage-2.1
View on GitHub
HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation
☆673Oct 14, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kandinskylab / kandinsky-5
View on GitHub
Kandinsky 5.0: A family of diffusion models for Video & Image generation
☆799Jul 5, 2026Updated 2 weeks ago
nv-tlabs / ChronoEdit
View on GitHub
[ICLR 2026] ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation
☆697Nov 20, 2025Updated 8 months ago
X-Omni-Team / X-Omni
View on GitHub
Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).
☆426Aug 26, 2025Updated 10 months ago
zai-org / GLM-Image
View on GitHub
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.
☆998Mar 20, 2026Updated 4 months ago
LemonSky1995 / DreamStyle
View on GitHub
DreamStyle: A Unified Framework for Video Stylization
☆124Jan 7, 2026Updated 6 months ago
FireRedTeam / FireRed-Image-Edit
View on GitHub
FireRed-Image-Edit is a powerful image editing foundation model achieving open-source state-of-the-art performance with precise instructi…
☆1,319Apr 3, 2026Updated 3 months ago
OPPO-Mente-Lab / Qwen-Image-Pruning
View on GitHub
CVPR 2026 Highlight: Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers
☆86Apr 9, 2026Updated 3 months ago
baaivision / Emu3.5
View on GitHub
Native Multimodal Models are World Learners
☆1,537Dec 30, 2025Updated 6 months ago
MCG-NJU / SteadyDancer
View on GitHub
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
☆636Dec 23, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ModelTC / LightX2V-Qwen-Image-Lightning
View on GitHub
Qwen-Image-Lightning: Speed up Qwen-Image model with distillation
☆1,340Jan 1, 2026Updated 6 months ago
little-misfit / GRAG-Image-Editing
View on GitHub
https://little-misfit.github.io/GRAG-Image-Editing/
☆119Nov 27, 2025Updated 7 months ago
inclusionAI / TwinFlow
View on GitHub
[ICLR 2026] Taming large-scale few-step training with self-adversarial flows! 👏🏻
☆536Feb 24, 2026Updated 5 months ago
stepfun-ai / NextStep-1
View on GitHub
[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …
☆690Feb 27, 2026Updated 4 months ago
SOTAMak1r / VINO-code
View on GitHub
A Unified Visual Generator with Interleaved OmniModal Context
☆232Mar 5, 2026Updated 4 months ago
WeChatCV / Stand-In
View on GitHub
[CVPR2026 🎉] Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.
☆777Feb 21, 2026Updated 5 months ago
sjtuplayer / UltraGen
View on GitHub
[AAAI 2026] UltraGen
☆77Feb 1, 2026Updated 5 months ago
QwenLM / Qwen-Image-Layered
View on GitHub
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
☆2,025Dec 31, 2025Updated 6 months ago
TencentYoutuResearch / T2I-L2P
View on GitHub
Code for "L2P: Unlocking Latent Potential for Pixel Generation"
☆179Jul 11, 2026Updated 2 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yejy53 / RealGen
View on GitHub
[ECCV 2026] RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards.
☆336Mar 13, 2026Updated 4 months ago
HiDream-ai / ReCo
View on GitHub
[ICML 2026] ReCo: In-Context Generation with Regional Constraints for Instructional Video Editing
☆170May 26, 2026Updated last month
AMD-AGI / Nitro-E
View on GitHub
Nitro-E is a family of text-to-image diffusion models focused on highly efficient training.
☆125Jun 4, 2026Updated last month
JiazheWei / PosterCopilot
View on GitHub
☆198Dec 10, 2025Updated 7 months ago
zai-org / SCAIL
View on GitHub
SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations (CVPR 2026 Findings)
☆1,024May 6, 2026Updated 2 months ago
Tencent-Hunyuan / HunyuanImage-3.0
View on GitHub
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
☆3,199Jun 23, 2026Updated last month
Songlin1998 / ShotVerse
View on GitHub
☆103Mar 13, 2026Updated 4 months ago
Kr1sJFU / iMontage
View on GitHub
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
☆188Dec 1, 2025Updated 7 months ago
baidu / ERNIE-Image
View on GitHub
ERNIE-Image is an open text-to-image generation model developed by the ERNIE-Image team at Baidu. It is built on a single-stream Diffusio…
☆493Apr 17, 2026Updated 3 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ssj9596 / One-to-All-Animation
View on GitHub
[CVPR 2026 Poster] One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
☆490Apr 19, 2026Updated 3 months ago
Kevin-thu / StoryMem
View on GitHub
Official code for StoryMem: Multi-shot Long Video Storytelling with Memory
☆760Updated this week
PKU-YuanGroup / Edit-R1
View on GitHub
Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback
☆295Jan 24, 2026Updated 6 months ago
EzioBy / Calligrapher
View on GitHub
Calligrapher: Freestyle Text Image Customization
☆297Sep 3, 2025Updated 10 months ago
QwenLM / Qwen-Image
View on GitHub
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
☆8,164Feb 10, 2026Updated 5 months ago
EzioBy / Ditto
View on GitHub
[CVPR'26 Highlight] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
☆617Jun 1, 2026Updated last month
Tongyi-MAI / Z-Image
View on GitHub
☆11,786Feb 9, 2026Updated 5 months ago