TonyLianLong/igligen

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TonyLianLong/igligen)

TonyLianLong / igligen

Improved Implementation for Training GLIGEN: Open-Set Grounded Text-to-Image Generation

☆46

Alternatives and similar repositories for igligen

Users that are interested in igligen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TonyLianLong / LLM-groundedVideoDiffusion
View on GitHub
[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper
☆172May 7, 2024Updated 2 years ago
microsoft / VISOR
View on GitHub
☆46Oct 27, 2023Updated 2 years ago
TonyLianLong / LLM-groundedDiffusion
View on GitHub
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…
☆483Sep 9, 2024Updated last year
Cominclip / BoxDiff-XL
View on GitHub
Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)
☆28May 23, 2024Updated 2 years ago
ryugo417 / TKG-DM
View on GitHub
☆15Apr 9, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dynamic-lm / interrupt-lrm
View on GitHub
🔥 [ICML 2026] Official implementation of "Are LRMs Interruptible?"
☆18Jun 18, 2026Updated last month
TonyLianLong / CrossMAE
View on GitHub
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
☆135Apr 10, 2025Updated last year
para-lost / ECHO
View on GitHub
Echo: "Constantly Improving Image Models Need Constantly Improving Benchmarks" (ICLR 2026)
☆20Jan 29, 2026Updated 5 months ago
visual-haystacks / mirage
View on GitHub
🔥 [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"
☆27Feb 9, 2025Updated last year
gligen / GLIGEN
View on GitHub
Open-Set Grounded Text-to-Image Generation
☆2,226Mar 6, 2024Updated 2 years ago
eslambakr / HRS_benchmark
View on GitHub
☆60Oct 13, 2023Updated 2 years ago
showlab / VisorGPT
View on GitHub
[NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT
☆138May 4, 2024Updated 2 years ago
zhenyuw16 / CompAgent_code
View on GitHub
Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".
☆18Jan 30, 2024Updated 2 years ago
Junyong-Jung / PSAD
View on GitHub
A Comprehensive Real-World Photometric Stereo Dataset for Unsupervised Anomaly Detection
☆14Oct 20, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
showlab / BoxDiff
View on GitHub
[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
☆275Nov 12, 2024Updated last year
RyannDaGreat / rp
View on GitHub
This is a python library. Install with "python3 -m pip install rp" then run with "python3 -m rp" or just "rp". Requires python≥3.5
☆13Jul 13, 2026Updated last week
microsoft / ReCo
View on GitHub
ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023
☆135Nov 8, 2023Updated 2 years ago
LinglingCai0314 / FreeMask
View on GitHub
☆11Jan 18, 2025Updated last year
KAIST-Visual-AI-Group / StochSync
View on GitHub
Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronizat…
☆21Jun 24, 2025Updated last year
jokersio-tsy / CroSel
View on GitHub
[CVPR 24] This is official implication for our paper: ''CroSel: Cross Selection of Confident Pseudo Labels for Partial-Label Learning''.
☆15Apr 27, 2025Updated last year
DongGeun-Yoon / Stereo-Magnification-Learning-view-synthesis-using-multiplane-images-MPIs-
View on GitHub
Reproduce Pytorch implementationof MPIs
☆12Aug 24, 2023Updated 2 years ago
tobran / StoryImager
View on GitHub
[ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
☆40Jul 5, 2024Updated 2 years ago
tulip-berkeley / open_clip
View on GitHub
An open source implementation of CLIP (With TULIP Support)
☆165May 14, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tsunghan-wu / reverse_vlm
View on GitHub
🔥 [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospe…
☆58Jan 22, 2026Updated 6 months ago
DongGeun-Yoon / DCP
View on GitHub
Official PyTorch implementation of "Lightweight Alpha Matting Network Using Distillation-Based Channel Pruning" (Asian Conference on Comp…
☆13Nov 5, 2022Updated 3 years ago
peter0749 / WGAN-GP-Anime-with-Auxiliary-Classifier
View on GitHub
☆11Nov 10, 2018Updated 7 years ago
ZJU4HealthCare / OmniCT
View on GitHub
【ICLR 2026】 Official Repo for Paper ‘’OmniCT: Towards a Unified Slice-Volume LVLM for Comprehensive CT Analysis‘’
☆18Mar 4, 2026Updated 4 months ago
hohonu-vicml / TrailBlazer
View on GitHub
[SIGGRAPH Asia 2024] TrailBlazer: Trajectory Control for Diffusion-Based Video Generation
☆102May 31, 2024Updated 2 years ago
Chenyu-Wang567 / All-Angles-Bench
View on GitHub
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs
☆69Mar 22, 2026Updated 3 months ago
visgym / VisGym
View on GitHub
Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents
☆114May 3, 2026Updated 2 months ago
Wwangb / AdvT-shirt-1K
View on GitHub
AdvT-shirt-1K A Physical-world Adversarial T-shirt Dataset for Adversarial Robustness Evaluation
☆14Aug 7, 2025Updated 11 months ago
anonymous-sushi-armadillo / fast_is_better_than_free_imagenet
View on GitHub
☆10Sep 25, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Karine-Huang / T2I-CompBench
View on GitHub
[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation
☆346May 7, 2026Updated 2 months ago
j-min / VPGen
View on GitHub
Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
☆57Jul 25, 2023Updated 2 years ago
DCDmllm / HyperLLaVA
View on GitHub
Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
☆28Mar 22, 2024Updated 2 years ago
LeyRio / MIG_Bench
View on GitHub
The MIG benchmark of CVPR2024 MIGC
☆15Mar 3, 2024Updated 2 years ago
apple / ml-space-benchmark
View on GitHub
Code and data for "Does Spatial Cognition Emerge in Frontier Models?"
☆29Apr 18, 2025Updated last year
zweiein / End_to_end_Speech_Papers
View on GitHub
☆13Sep 12, 2017Updated 8 years ago
yuanzhongqiao / Industrial-Defect-Diffusion-Model
View on GitHub
Industrial Defect Diffusion Model (NOT JUST INDUSTRIAL DEFECT~), support DDPM, DDIM and multi-GPU distributed training. 分布式训练，生成模型，扩散模型
☆17Nov 10, 2023Updated 2 years ago