MIMICLab/L-Verse

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MIMICLab/L-Verse)

MIMICLab / L-Verse

L-Verse: Bidirectional Generation Between Image and Text

☆107

Alternatives and similar repositories for L-Verse

Users that are interested in L-Verse are comparing it to the libraries listed below

Sorting:

tgisaturday / dalle-lightning
View on GitHub
Refactoring dalle-pytorch and taming-transformers for TPU VM
☆60Aug 30, 2021Updated 4 years ago
CompVis / imagebart
View on GitHub
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
☆126Mar 14, 2022Updated 4 years ago
CompVis / visual-search
View on GitHub
Visual search interface
☆11Nov 30, 2021Updated 4 years ago
MIMICLab / BITTERS
View on GitHub
Large-Scale Bidirectional Training for Zero-Shot Image Captioning
☆21Feb 14, 2023Updated 3 years ago
AranKomat / Diff-DALLE
View on GitHub
☆65Nov 4, 2021Updated 4 years ago
JiwanChung / tapm
View on GitHub
☆11Dec 8, 2022Updated 3 years ago
AranKomat / Metroplex
View on GitHub
☆21Mar 15, 2023Updated 3 years ago
NightmareAI / majesty-diffusion
View on GitHub
Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)
☆25Jul 26, 2022Updated 3 years ago
Jack000 / DALLE-pytorch
View on GitHub
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
☆88Dec 3, 2021Updated 4 years ago
navervision / KELIP
View on GitHub
Official PyTorch implementation of "Large-scale Bilingual Language-Image Contrastive Learning" (ICLRW 2022)
☆96Apr 13, 2022Updated 3 years ago
afiaka87 / latent-diffusion-deepspeed
View on GitHub
Finetune the 1.4B latent diffusion text2img-large checkpoint from CompVis using deepspeed. (work-in-progress)
☆36Apr 17, 2022Updated 3 years ago
drboog / Lafite
View on GitHub
Code for paper LAFITE: Towards Language-Free Training for Text-to-Image Generation (CVPR 2022)
☆183Mar 23, 2023Updated 2 years ago
pbaylies / Augmented_CLIP
View on GitHub
Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.
☆60Mar 31, 2022Updated 3 years ago
j-min / DallEval
View on GitHub
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)
☆143Jun 10, 2025Updated 9 months ago
learning-at-home / dalle-hivemind
View on GitHub
Training a model similar to OpenAI DALL-E with volunteers from all over the Internet using hivemind and dalle-pytorch (NeurIPS 2021 demo)
☆27May 29, 2023Updated 2 years ago
mehdidc / feed_forward_vqgan_clip
View on GitHub
Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt
☆140Jan 3, 2024Updated 2 years ago
YoadTew / zero-shot-image-to-text
View on GitHub
Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
☆278Sep 17, 2022Updated 3 years ago
jiyounglee-0523 / FourierDecoder
View on GitHub
Official repository for Fourier model that can generate periodic signals
☆10Mar 10, 2022Updated 4 years ago
JD-P / cloob-latent-diffusion
View on GitHub
CLOOB Conditioned Latent Diffusion training and inference code
☆111Apr 15, 2022Updated 3 years ago
pbaylies / clustering-laion400m
View on GitHub
Script and models for clustering LAION-400m CLIP embeddings.
☆26Jan 10, 2022Updated 4 years ago
halcy / tpuddim
View on GitHub
☆22May 3, 2022Updated 3 years ago
microsoft / VQ-Diffusion
View on GitHub
Official implementation of VQ-Diffusion
☆978Apr 17, 2024Updated last year
dzryk / cliptalk
View on GitHub
☆20Aug 19, 2021Updated 4 years ago
bes-dev / pytorch_clip_guided_loss
View on GitHub
A simple library that implements CLIP guided loss in PyTorch.
☆77Dec 25, 2021Updated 4 years ago
kakaobrain / mindall-e
View on GitHub
PyTorch implementation of a 1.3B text-to-image generation model trained on 14 million image-text pairs
☆634Aug 9, 2022Updated 3 years ago
cfoster0 / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆87Mar 6, 2022Updated 4 years ago
easonnie / mlp-vil
View on GitHub
MLPs for Vision and Langauge Modeling (Coming Soon)
☆27Dec 9, 2021Updated 4 years ago
crowsonkb / pytorch-caffe-models
View on GitHub
The original weights of some Caffe models, ported to PyTorch.
☆11Jan 18, 2022Updated 4 years ago
ubc-vision / attribute-guided-image-generation-from-layout
View on GitHub
☆10Aug 28, 2020Updated 5 years ago
GT-RIPL / Xmodal-Ctx
View on GitHub
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for …
☆61Oct 21, 2022Updated 3 years ago
facebookresearch / OTTER
View on GitHub
This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …
☆71Dec 20, 2021Updated 4 years ago
afiaka87 / clip-guided-diffusion
View on GitHub
A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
☆460Dec 31, 2025Updated 2 months ago
EleutherAI / vqgan-clip
View on GitHub
☆354May 10, 2022Updated 3 years ago
facebookresearch / SLIP
View on GitHub
Code release for SLIP Self-supervision meets Language-Image Pre-training
☆787Feb 9, 2023Updated 3 years ago
afiaka87 / pyglide
View on GitHub
A CLI tool for using GLIDE to generate images from text.
☆67May 5, 2022Updated 3 years ago
gnobitab / FuseDream
View on GitHub
☆195Dec 7, 2021Updated 4 years ago
CuthbertCai / Ask-Confirm
View on GitHub
Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)
☆20Dec 4, 2021Updated 4 years ago
samb-t / unleashing-transformers
View on GitHub
Code for the ECCV 2022 paper "Unleashing Transformers"
☆185Apr 17, 2023Updated 2 years ago
adymaharana / VLCStoryGan
View on GitHub
Official code repository for the EMNLP 2021 paper
☆26Jan 30, 2022Updated 4 years ago