simonsanvil/DALL-E-Explained

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/simonsanvil/DALL-E-Explained)

simonsanvil / DALL-E-Explained

Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image generation schemes

☆33

Alternatives and similar repositories for DALL-E-Explained

Users that are interested in DALL-E-Explained are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

camenduru / FoleyCrafter-jupyter
View on GitHub
☆10Jun 28, 2024Updated 2 years ago
camenduru / V-Express-jupyter
View on GitHub
☆14May 31, 2024Updated 2 years ago
YingqingHe / ScaleCrafter-ptl
View on GitHub
☆14Oct 16, 2023Updated 2 years ago
daooshee / TE141K
View on GitHub
Project website of TE141K.
☆17Mar 24, 2020Updated 6 years ago
jaechanjo / TIFF
View on GitHub
Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation
☆24Jun 24, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
camenduru / marigold-lcm-hf
View on GitHub
☆12Mar 25, 2024Updated 2 years ago
LARS-research / TREFE
View on GitHub
Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022
☆13Nov 25, 2022Updated 3 years ago
ihdia / seamformer
View on GitHub
Official repository accompaying the ICDAR 2023 paper
☆14Oct 3, 2023Updated 2 years ago
zwx8981 / PerceptualAttack_BIQA
View on GitHub
[NeurIPS2022] Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop
☆13Apr 13, 2023Updated 3 years ago
camenduru / Multi-LoRA-Composition-jupyter
View on GitHub
☆13Feb 28, 2024Updated 2 years ago
callsys / FlowText
View on GitHub
[ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation
☆13May 13, 2023Updated 3 years ago
kartikgill / taco-box
View on GitHub
An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR
☆15Dec 4, 2021Updated 4 years ago
camenduru / AutoStudio-jupyter
View on GitHub
☆15Jun 25, 2024Updated 2 years ago
PeterouZh / Deep_Generative_Models
View on GitHub
A collection of papers I am interested in.
☆29Apr 3, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
thanhnghiadk / syntactic_HME_generation
View on GitHub
This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.
☆14Feb 24, 2022Updated 4 years ago
idansc / discriminative_class_tokens
View on GitHub
☆41Mar 27, 2024Updated 2 years ago
DCGM / SoftCTC
View on GitHub
This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135
☆19Mar 7, 2023Updated 3 years ago
SiskonEmilia / Anime-Wifu-Dataset
View on GitHub
A tool kit to generate a dataset of anime faces.
☆21Jun 8, 2019Updated 7 years ago
georgeretsi / Seq2Emb
View on GitHub
Create handwritten word embeddings from a text recognition Seq2Seq system.
☆11Dec 1, 2022Updated 3 years ago
SII-sc22mc / DocFusion
View on GitHub
A Unified Framework for Document Parsing Tasks (Including Document Layout Analysis, OCR, Formula Recognition, and Table Recognition)
☆15Jul 1, 2025Updated last year
EDM-Research / VATr-pp
View on GitHub
☆18Jul 9, 2024Updated 2 years ago
ThunderVVV / RCLSTR
View on GitHub
Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`
☆17Sep 22, 2023Updated 2 years ago
dshea89 / tesseract-retraining-pipeline
View on GitHub
Intuitive interface for fine-tuning and retraining a Tesseract OCR language model
☆10Jul 4, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
reka-ai / research-eval
View on GitHub
A benchmark to evaluate search-augmented LLMs
☆17Aug 28, 2025Updated 10 months ago
psychopa4 / MSHPFNL
View on GitHub
A Progressive Fusion Generative Adversarial Network for Realistic and Consistent Video Super-Resolution
☆16Jan 22, 2022Updated 4 years ago
Alpha-Innovator / DocParser
View on GitHub
☆18Jan 13, 2025Updated last year
camenduru / DiffSketcher-colab
View on GitHub
☆16Dec 18, 2023Updated 2 years ago
yqingli123 / TDv2
View on GitHub
The source codes of TDv2 in paper: TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition.
☆12Jul 28, 2022Updated 3 years ago
amazon-science / textadain-robust-recognition
View on GitHub
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
☆21Jul 26, 2022Updated 3 years ago
yufanchen96 / GraphDoc
View on GitHub
Graph-based Document Structure Analysis
☆18Mar 26, 2025Updated last year
IDEA-Research / hana
View on GitHub
Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch
☆17Dec 22, 2022Updated 3 years ago
HyoKong / DreamDrone
View on GitHub
Navigate dreamscapes with a click – your chosen point guides the drone’s flight in a thrilling visual journey.
☆48Sep 2, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
camenduru / zero123plus-colab
View on GitHub
☆16Jan 2, 2024Updated 2 years ago
MenghanXia / DisentangledColorization
View on GitHub
A colorization framework that disentangles the color multimodality and the structural consistency via adaptively located anchors, so that…
☆115Jun 18, 2025Updated last year
buaacxf / VIPTR
View on GitHub
☆44Jul 9, 2024Updated 2 years ago
facebookresearch / language-model-plasticity
View on GitHub
Official code for the paper Improving Language Plasticity via Pretraining with Active Forgetting, NeurIPS 2023
☆21Mar 12, 2026Updated 4 months ago
mkshing / prompt-plus-pytorch
View on GitHub
Implementation of P+: Extended Textual Conditioning in Text-to-Image Generation
☆49Mar 26, 2023Updated 3 years ago
BRZ911 / ViTCoT
View on GitHub
[ACM MM 2025] ViTCoT: Video-Text Interleaved Chain-of-Thought for Boosting Video Understanding in Large Language Models
☆18Jul 15, 2025Updated last year
ai-forever / deforum-kandinsky
View on GitHub
Kandinsky x Deforum — generating short animations
☆104Jan 22, 2024Updated 2 years ago