PicoTrex / Awesome-GPT-Image-generationLinks
☆12Updated 4 months ago
Alternatives and similar repositories for Awesome-GPT-Image-generation
Users that are interested in Awesome-GPT-Image-generation are comparing it to the libraries listed below
Sorting:
- Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆87Updated this week
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆33Updated 3 weeks ago
- ☆11Updated 8 months ago
- EARL: Editing with Autoregression and RL☆23Updated 3 weeks ago
- ☆32Updated 8 months ago
- code for FineLIP☆28Updated 5 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆122Updated 9 months ago
- Official code for "DiffX: Guide Your Layout to Cross-Modal Generative Modeling"☆23Updated 6 months ago
- ☆48Updated this week
- ☆80Updated 6 months ago
- [ICCV 2025] Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"☆46Updated 6 months ago
- The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"☆53Updated 2 months ago
- Transactions on Multimedia (TMM25)☆16Updated 4 months ago
- Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"☆51Updated last week
- [CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution☆173Updated 5 months ago
- VisualQuality-R1 is the first open-sourced NR-IQA model can accurately describe and rate the image quality.☆86Updated this week
- UniGenBench: A Unified T2I Generation Benchmark☆38Updated this week
- Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆54Updated 3 months ago
- [AAAI 2025]This repo contains evaluation code for the paper “UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in…☆34Updated 4 months ago
- [ICCV 25] VMBench: A Benchmark for Perception-Aligned Video Motion Generation☆58Updated 3 weeks ago
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆36Updated 3 months ago
- SAM4SS: Tailoring SAM and SAM2 for Semantic Segmentation☆10Updated last year
- Training Autoregressive Image Generation models via Reinforcement Learning☆27Updated 2 weeks ago
- [ICCV 2025] The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆68Updated last month
- 【CVPR 2025 Oral】Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"☆179Updated 4 months ago
- ☆15Updated 3 months ago
- [CVPR2025] FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression☆49Updated 6 months ago
- [CVPR 2025] Official PyTorch implementation of Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability☆29Updated last month
- [NeurIPS2024]☆28Updated 8 months ago
- ☆149Updated 2 months ago