PicoTrex / Awesome-GPT-Image-generationLinks

☆12

Alternatives and similar repositories for Awesome-GPT-Image-generation

Users that are interested in Awesome-GPT-Image-generation are comparing it to the libraries listed below

Sorting:

PhoenixZ810 / RISEBench
Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing
☆87Updated this week
VisionXLab / LRS-VQA
[ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning
☆33Updated 3 weeks ago
HVision-NKU / MaskDiffusion
☆11Updated 8 months ago
mair-lab / EARL
EARL: Editing with Autoregression and RL
☆23Updated 3 weeks ago
xuliu-cyber / RSUniVLM
☆32Updated 8 months ago
tiiuae / FineLIP
code for FineLIP
☆28Updated 5 months ago
MiracleDance / CAR
CAR: Controllable AutoRegressive Modeling for Visual Generation
☆122Updated 9 months ago
zeyuwang-zju / DiffX
Official code for "DiffX: Guide Your Layout to Cross-Modal Generative Modeling"
☆23Updated 6 months ago
HVision-NKU / OneVAE
☆48Updated this week
Sonettoo / CRS-Diff
☆80Updated 6 months ago
congvvc / InstructSeg
[ICCV 2025] Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"
☆46Updated 6 months ago
opendatalab / LEGION
The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"
☆53Updated 2 months ago
liuting20 / SwimVG
Transactions on Multimedia (TMM25)
☆16Updated 4 months ago
HVision-NKU / GlimpsePrune
Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"
☆51Updated last week
zhiyuanyou / DeQA-Score
[CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution
☆173Updated 5 months ago
TianheWu / VisualQuality-R1
VisualQuality-R1 is the first open-sourced NR-IQA model can accurately describe and rate the image quality.
☆86Updated this week
CodeGoat24 / UniGenBench
UniGenBench: A Unified T2I Generation Benchmark
☆38Updated this week
SuleBai / SC-CLIP
Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
☆54Updated 3 months ago
opendatalab / UrBench
[AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…
☆34Updated 4 months ago
AMAP-ML / VMBench
[ICCV 25] VMBench: A Benchmark for Perception-Aligned Video Motion Generation
☆58Updated 3 weeks ago
songw-zju / PixelThink
The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)
☆36Updated 3 months ago
zhengxuJosh / SAM4SS
SAM4SS: Tailoring SAM and SAM2 for Semantic Segmentation
☆10Updated last year
Kwai-Klear / AR-GRPO
Training Autoregressive Image Generation models via Reinforcement Learning
☆27Updated 2 weeks ago
opendatalab / skydiffusion
[ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”
☆68Updated last month
DCDmllm / AnyEdit
【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"
☆179Updated 4 months ago
lisat-bair / LISAt_code
☆15Updated 3 months ago
codefanw / FlashSloth
[CVPR2025] FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression
☆49Updated 6 months ago
gudaochangsheng / MaskUnet
[CVPR 2025] Official PyTorch implementation of Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability
☆29Updated last month
JiahuaDong / CIFC
[NeurIPS2024]
☆28Updated 8 months ago
wusize / OpenUni
☆149Updated 2 months ago