hzphzp/WeGen

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hzphzp/WeGen)

hzphzp / WeGen

☆27

Alternatives and similar repositories for WeGen

Users that are interested in WeGen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Mowenyii / Uniform-Attention-Maps
View on GitHub
[WACV 2025] Uniform Attention Maps: Enhancing Image Fidelity in Reconstruction and Editing
☆17Mar 16, 2025Updated last year
showlab / DoraCycle
View on GitHub
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles
☆31Mar 8, 2026Updated 4 months ago
OPPO-Mente-Lab / X2I
View on GitHub
Official code for ICCV 2025 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distill…
☆89Jun 26, 2025Updated last year
zhangguiwei610 / V2Flow
View on GitHub
☆29Mar 30, 2025Updated last year
UCSC-VLAA / Complex-Edit
View on GitHub
Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark
☆29Apr 22, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
zhangzef / COOPER
View on GitHub
The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.
☆38Jul 1, 2026Updated 3 weeks ago
rongyaofang / GoT
View on GitHub
Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"
☆317Sep 28, 2025Updated 9 months ago
JIA-Lab-research / RePlan
View on GitHub
(ECCV2026) RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing
☆67Jul 1, 2026Updated 3 weeks ago
InternLM / Spark
View on GitHub
An official implementation of "SPARK: Synergistic Policy And Reward Co-Evolving Framework"
☆25Oct 23, 2025Updated 9 months ago
Mowenyii / PAE
View on GitHub
[CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation
☆87Jul 13, 2024Updated 2 years ago
WeChatCV / UnicBench
View on GitHub
[CVPR 2026] UnicEdit-10M and UnicBench project
☆42Mar 3, 2026Updated 4 months ago
hutaiHang / ATM
View on GitHub
[ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"
☆28Apr 15, 2025Updated last year
Tencent / HaploVLM
View on GitHub
ICML2025
☆63Aug 28, 2025Updated 10 months ago
tyshiwo1 / Accelerating-T2I-AR-with-SJD
View on GitHub
[ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
☆52Apr 21, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
InternLM / ETCHR
View on GitHub
A question-conditioned, reasoning-aware image editor designed to serve as a decoupled visual reasoning assistant for Multimodal Large Lan…
☆23May 25, 2026Updated last month
FreqEdit / FreqEdit
View on GitHub
[CVPR2026] Official implementation of "FreqEdit: Preserving High-Frequency Features for Robust Multi-Turn Image Editing"
☆15Mar 31, 2026Updated 3 months ago
ATH-MaaS / Ovis-U1
View on GitHub
An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…
☆450Dec 2, 2025Updated 7 months ago
steb6 / ISBFSAR
View on GitHub
Interactive Skeleton Based Few Shot Action Recognition
☆14Nov 8, 2022Updated 3 years ago
TencentARC / MindOmni
View on GitHub
[NeurIPS2025] The official implementation of MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO
☆139Oct 15, 2025Updated 9 months ago
lyrig / TokenAR
View on GitHub
TokenAR: Multiple Subject Generation via Autoregressive Token-level enhancement
☆22Mar 4, 2026Updated 4 months ago
moatifbutt / color-peel
View on GitHub
we propose to generate a series of geometric shapes with target colors to disentangle (or peel off ) the target colors from the shapes. B…
☆67Oct 7, 2024Updated last year
Eureka-Maggie / MIGE
View on GitHub
Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing
☆72Jul 13, 2025Updated last year
NTUYWANG103 / MovingColor
View on GitHub
[ACM MM 2024 (Oral)] Official PyTorch Implementation of Paper "MovingColor: Seamless Fusion of Fine-grained Video Color Enhancement"
☆12Dec 30, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
hutaiHang / ToMe
View on GitHub
[NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis
☆86Feb 3, 2025Updated last year
Maplebb / UniREditBench
View on GitHub
[ECCV 2026] Offline implementation of UniREditBench: A Unified Reasoning-based Image Editing Benchmark.
☆58Jun 21, 2026Updated last month
Bujiazi / HiFlow
View on GitHub
[NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance
☆88Sep 18, 2025Updated 10 months ago
qiujihao19 / LongVideo-R1
View on GitHub
[CVPR 2026] LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding
☆50Jul 7, 2026Updated 2 weeks ago
Jolieresearch / ICPF
View on GitHub
☆14Nov 26, 2025Updated 7 months ago
TiantianWang / ICCV17_SRM
View on GitHub
☆22Aug 1, 2018Updated 7 years ago
cyfml / OPSTL
View on GitHub
OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments
☆14Oct 25, 2023Updated 2 years ago
SitongGong / Veason-R1
View on GitHub
Official code of Veason-R1
☆15Jul 14, 2026Updated last week
bcmi / RETAB-Weak-Shot-Semantic-Segmentation
View on GitHub
Official Implementation for Weak-shot Semantic Segmentation by Transferring Semantic Affinity and Boundary (BMVC 2022)
☆23Apr 8, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ZiyuGuo99 / Image-Generation-CoT
View on GitHub
[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation
☆865Mar 19, 2026Updated 4 months ago
zhentao-zou / MURE
View on GitHub
Beyond Textual CoT: Interleaved Text-image chains with Deep Confidence Reasoning for Image Editing
☆19Jun 24, 2026Updated last month
Character-Adapter / Character-Adapter
View on GitHub
☆63Jul 3, 2024Updated 2 years ago
bcmi / ProPIH-Painterly-Image-Harmonization
View on GitHub
[AAAI2024] Progressive Painterly Image Harmonization from Low-level Styles to High-level Styles
☆25Feb 24, 2026Updated 5 months ago
weichow23 / EditMGT
View on GitHub
Official Repo for Paper <EditMGT Unleashing the Potential of Masked Generative Transformer in Image Editing>
☆79Dec 20, 2025Updated 7 months ago
Karine-Huang / T2I-CompBench
View on GitHub
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation
☆345May 7, 2026Updated 2 months ago
zhuangshaobin / WeTok
View on GitHub
[ICLR2026] WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction
☆69Sep 3, 2025Updated 10 months ago