xichenpan/Kosmos-G

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xichenpan/Kosmos-G)

xichenpan / Kosmos-G

Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models

☆75

Alternatives and similar repositories for Kosmos-G

Users that are interested in Kosmos-G are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eclipse-t2i / lambda-eclipse-inference
View on GitHub
[TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent…
☆53Nov 29, 2024Updated last year
Lyne1 / Realgeneral
View on GitHub
RealGeneral (ICCV2025)
☆17Jul 16, 2025Updated last year
Xiaojiu-z / SSR_Encoder
View on GitHub
Pytorch Implementation of "SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation"(CVPR 2024)
☆128Jul 22, 2024Updated 2 years ago
facebookresearch / metaquery
View on GitHub
Official Implementation of Paper Transfer between Modalities with MetaQueries
☆324Oct 12, 2025Updated 9 months ago
lyuPang / CrossInitialization
View on GitHub
☆40Dec 24, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Colezwhy / Layout-Your-3D
View on GitHub
[ICLR 2025] Layout-Your-3D: Controllable and Precise 3D Generation with 2D Blueprint
☆21Dec 22, 2025Updated 7 months ago
TIGER-AI-Lab / OmniEdit
View on GitHub
Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]
☆144Jan 27, 2025Updated last year
yuangpeng / dreambench_plus
View on GitHub
[ICLR 2025] Official code implementation of DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
☆138Feb 23, 2025Updated last year
VinAIResearch / EFHQ
View on GitHub
Code and data for the CVPR24 paper "EFHQ: Multi-purpose ExtremePose-Face-HQ dataset" [CVPR'24]
☆29Jul 23, 2024Updated 2 years ago
KwonGihyun / TweedieMix
View on GitHub
Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)
☆62Jan 22, 2025Updated last year
RunpeiDong / DreamLLM
View on GitHub
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
☆462Dec 2, 2024Updated last year
fusiming3 / MARS
View on GitHub
Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
☆86Jul 16, 2024Updated 2 years ago
GAIR-NLP / anole
View on GitHub
[Extended verision ICLR 2025 Blog Track] Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generatio…
☆842Jun 16, 2025Updated last year
HKUST-LongGroup / CoMM
View on GitHub
[CVPR 2025 Highlight] Official repository for CoMM Dataset
☆56Dec 31, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
MS-Diffusion / MS-Diffusion
View on GitHub
[ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
☆311Jul 30, 2025Updated 11 months ago
Corleone-Huang / RealCustomProject
View on GitHub
☆19Apr 16, 2025Updated last year
cvlab-kaist / DreamMatcher
View on GitHub
Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (…
☆174Feb 27, 2024Updated 2 years ago
Monalissaa / DisenDiff
View on GitHub
[CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization
☆111Apr 10, 2024Updated 2 years ago
UCSC-VLAA / Complex-Edit
View on GitHub
Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark
☆29Apr 22, 2025Updated last year
thu-ml / Efficient-Diffusion-Alignment
View on GitHub
Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)
☆15Oct 29, 2024Updated last year
MrZilinXiao / AutoVER
View on GitHub
[ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.
☆14Mar 2, 2024Updated 2 years ago
FrameX-AI / Stream-T1
View on GitHub
☆37Jun 23, 2026Updated last month
aim-uofa / FreeCustom
View on GitHub
[CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
☆177Sep 1, 2025Updated 10 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Qinyu-Allen-Zhao / Arinar
View on GitHub
☆43May 30, 2025Updated last year
StarsTesla / RePaint-NeRF
View on GitHub
Officially repo of RePaint-NeRF
☆13Dec 17, 2023Updated 2 years ago
baaivision / Emu
View on GitHub
Emu Series: Generative Multimodal Models from BAAI
☆1,776Jan 12, 2026Updated 6 months ago
YangLing0818 / IterComp
View on GitHub
[ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
☆203Feb 19, 2025Updated last year
mair-lab / EARL
View on GitHub
EARL: Editing with Autoregression and RL
☆43Nov 21, 2025Updated 8 months ago
kohjingyu / gill
View on GitHub
🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".
☆470Jan 19, 2024Updated 2 years ago
JiuhaiChen / BLIP3o
View on GitHub
Official implementation of BLIP3o-Series
☆1,663Nov 29, 2025Updated 7 months ago
trungdt880 / training-free-diffusion-variable-sized
View on GitHub
Unofficial Implementation of Training-free Diffusion Model Adaptation for Variable-Sized Text-to-Image Synthesis
☆16Sep 27, 2023Updated 2 years ago
SalesforceAIResearch / DiffusionDPO
View on GitHub
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
☆705Jun 2, 2026Updated last month
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
zhangxulu1996 / awesome-personalization
View on GitHub
☆24Apr 10, 2025Updated last year
showlab / Awesome-Unified-Multimodal-Models
View on GitHub
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
☆830Oct 10, 2025Updated 9 months ago
TencentARC / MasaCtrl
View on GitHub
[ICCV 2023] Consistent Image Synthesis and Editing
☆843Aug 19, 2024Updated last year
cambrian-mllm / cambrian-p
View on GitHub
Cambrian-P: Pose-Grounded Video Understanding
☆102Updated this week
AILab-CVC / SEED
View on GitHub
Official implementation of SEED-LLaMA (ICLR 2024).
☆642Sep 21, 2024Updated last year
bytedance / DEADiff
View on GitHub
[CVPR 2024] Official implementation of "DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations"
☆280Jul 5, 2025Updated last year
MiZhenxing / ThinkDiff
View on GitHub
ICML2025, I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models
☆191Sep 7, 2025Updated 10 months ago