zyang-ur/idea2img

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zyang-ur/idea2img)

zyang-ur / idea2img

Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation, ECCV 2024

☆22

Alternatives and similar repositories for idea2img

Users that are interested in idea2img are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZhangXJ199 / TinyLLaVA-Video-R1
View on GitHub
TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning
☆116Dec 24, 2025Updated 7 months ago
Dinghow / UIM
View on GitHub
The official pytorch implementation of Exploring the Interactive Guidance for Unified and Effective Image Matting [TOMM 2025]
☆25Nov 24, 2025Updated 8 months ago
Relaxed-System-Lab / multi-actor-data-selection
View on GitHub
This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.
☆49Aug 22, 2025Updated 11 months ago
EngineeringAI-LAB / StoryBlender
View on GitHub
A Blender extension for automated, editable, and inter-shot consistent 3D storyboard production.
☆23Jul 9, 2026Updated 2 weeks ago
AsadKhan261 / before_after_image_slider_nullsafty
View on GitHub
☆17Aug 18, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
opendatalab / LEGION
View on GitHub
[ICCV25 Highlight] The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"
☆82Oct 22, 2025Updated 9 months ago
opendatalab / CrossViewDiff
View on GitHub
The official implementation of the paper "CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis"
☆16Sep 2, 2024Updated last year
showlab / videogui
View on GitHub
[NeurIPS 2024 D&B] VideoGUI: A Benchmark for GUI Automation from Instructional Videos
☆53Feb 22, 2026Updated 5 months ago
jiahuei / sparse-image-captioning
View on GitHub
Image captioning with weight pruning in PyTorch
☆22Jan 14, 2022Updated 4 years ago
birdortyedi / DeNIM
View on GitHub
Official Repository for Deterministic Neural Illuminant Mapping (DeNIM) published in ICCV2023 Workshops
☆23Aug 7, 2023Updated 2 years ago
findalexli / mllm-dpo
View on GitHub
[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model
☆48Nov 10, 2024Updated last year
TerminologyHub / termhub-in-5-minutes
View on GitHub
Developer project for getting basic API integrations working in under 5 minutes
☆11May 22, 2026Updated 2 months ago
ZichenWen1 / DIJA
View on GitHub
(ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"
☆79Feb 9, 2026Updated 5 months ago
PicoTrex / GPT-ImgEval
View on GitHub
GPT-ImgEval: Evaluating GPT-4o’s state-of-the-art image generation capabilities
☆307May 3, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CaraJ7 / DraCo
View on GitHub
Offical Repository for Paper: DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
☆17Dec 7, 2025Updated 7 months ago
ChenHsing / VIDiff
View on GitHub
☆39Dec 4, 2023Updated 2 years ago
conghui / replaycode
View on GitHub
ReplayCode — first open-source rebuild of Claude Code that actually runs. Built from decompiled source with Node.js/esbuild
☆20Apr 1, 2026Updated 3 months ago
victor-rong / video-generation-study
View on GitHub
☆16Nov 28, 2023Updated 2 years ago
OssianEriksson / autonomous-twizy
View on GitHub
ROS packages for control of an autonomous Renault Twizy at the Department of Electrical Engineering, Chalmers University of Technology, S…
☆11May 30, 2021Updated 5 years ago
liutianlin0121 / musc
View on GitHub
Implementation of "Learning Multiscale Convolutional Dictionaries for Image Reconstruction", IEEE Transaction On Computational Imaging, 2…
☆32Apr 17, 2023Updated 3 years ago
opendatalab / LOKI
View on GitHub
[ICLR 2025 Spotlight] The official implementation of the paper “LOKI：A Comprehensive Synthetic Data Detection Benchmark using Large Multi…
☆180Feb 7, 2026Updated 5 months ago
QingtangDing / DCSR
View on GitHub
Not All Patches Are Equal: Hierarchical Dataset Condensation for Single Image Super-Resolution
☆10May 7, 2024Updated 2 years ago
cvlab-columbia / paperbot
View on GitHub
PaperBot: Learning to Design Real-World Tools Using Paper
☆13Mar 15, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tum-bgd / CVDisaster
View on GitHub
Cross-View Geolocalization and Disaster Mapping with Street-View and VHR Satellite Imagery: A Case Study of Hurricane IAN
☆19Oct 3, 2024Updated last year
abhinit21 / building-agentic-rag-with-llamaindex
View on GitHub
Learn how to build agents that can reason over their own documents
☆14May 21, 2024Updated 2 years ago
senorfy / Kinect
View on GitHub
用Kinect2.0读取图像的深度等信息，分割出手部图像。用HOG提取手部图像信息，接着用SVM进行训练。目的是为了识别手势。
☆10Jan 8, 2020Updated 6 years ago
LHL3341 / MetaLadder
View on GitHub
MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)
☆12Apr 18, 2025Updated last year
opendatalab / FakeVLM
View on GitHub
[NeurIPS 2025 🔥] FakeVLM: Advancing Synthetic Image Detection through Explainable Multimodal Models and Fine-Grained Artifact Analysis
☆157Sep 24, 2025Updated 10 months ago
divyakraman / AerialDiffusion
View on GitHub
Codebase for the paper Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a Single Image using Diffusion Models
☆13Oct 3, 2023Updated 2 years ago
redredsheep / PrismLayers
View on GitHub
PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models
☆37Jan 14, 2026Updated 6 months ago
aaronzberger / CMU_Find_Stalk
View on GitHub
Detect corn stalks for micro-sensor insertion
☆12Mar 5, 2024Updated 2 years ago
jialuli-luka / SELMA
View on GitHub
Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
☆35Mar 12, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
duchesneaumathieu / pyperlin
View on GitHub
GPU accelerated Perlin Noise in python
☆11Oct 23, 2020Updated 5 years ago
JialeWei / OneBEV
View on GitHub
OneBEV: Using One Panoramic Image for Bird’s-Eye-View Semantic Mapping
☆34Jan 10, 2025Updated last year
lisat-bair / LISAt_code
View on GitHub
☆30Sep 2, 2025Updated 10 months ago
NickyFot / HitchhikersGuide
View on GitHub
Official repository of "A Hitchhiker's Guide to Fine-Grained Face Forgery Detection Using Common Sense Reasoning" published in NeurIPS'20…
☆12Feb 7, 2025Updated last year
keshik6 / grafting
View on GitHub
[NeurIPS 2025 Oral] Official Code for Exploring Diffusion Transformer Designs via Grafting
☆74Jan 9, 2026Updated 6 months ago
ZichenWen1 / AHGFC
View on GitHub
The source code for “Homophily-Related: Adaptive Hybrid Graph Filter for Multi-View Graph Clustering”
☆11Apr 10, 2024Updated 2 years ago
Semanticity-Research / SRBIM
View on GitHub
Official Implementation of SRBIM (CVPRW 2024, Oral)
☆14Jun 13, 2024Updated 2 years ago