lucidrains / DALLE2-pytorchView external linksLinks
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
☆11,332May 11, 2024Updated last year
Alternatives and similar repositories for DALLE2-pytorch
Users that are interested in DALLE2-pytorch are comparing it to the libraries listed below
Sorting:
- Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch☆8,410Oct 7, 2024Updated last year
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆5,629Feb 17, 2024Updated last year
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆32,768Updated this week
- High-Resolution Image Synthesis with Latent Diffusion Models☆13,845Feb 29, 2024Updated last year
- A latent text-to-image diffusion model☆72,368Jun 18, 2024Updated last year
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,562Jul 23, 2024Updated last year
- GLIDE: a diffusion-based text-conditional image synthesis model☆3,686Mar 8, 2024Updated last year
- DALL·E Mini - Generate images from a text prompt☆14,811Nov 9, 2023Updated 2 years ago
- An open source implementation of CLIP.☆13,353Nov 4, 2025Updated 3 months ago
- ☆7,291Jul 2, 2024Updated last year
- PyTorch package for the discrete VAE used for DALL·E.☆10,875Jan 31, 2024Updated 2 years ago
- Pretrained Dalle2 from laion☆504Apr 15, 2023Updated 2 years ago
- Let us control diffusion models!☆33,621Feb 25, 2024Updated last year
- Taming Transformers for High-Resolution Image Synthesis☆6,429Jul 30, 2024Updated last year
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆24,993Updated this week
- Implementation of Denoising Diffusion Probabilistic Model in Pytorch☆10,455Aug 4, 2025Updated 6 months ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,021Jan 23, 2026Updated 3 weeks ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,166Nov 18, 2024Updated last year
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆722Oct 16, 2023Updated 2 years ago
- ☆3,046Feb 27, 2023Updated 2 years ago
- A collection of resources and papers on Diffusion Models☆12,273Aug 1, 2024Updated last year
- Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.☆8,803Dec 10, 2023Updated 2 years ago
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,351Updated this week
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion☆7,751Dec 8, 2022Updated 3 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,143Sep 30, 2025Updated 4 months ago
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆53,411Sep 18, 2024Updated last year
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆4,358Oct 19, 2025Updated 3 months ago
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆23,494Aug 15, 2024Updated last year
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆156,173Feb 7, 2026Updated last week
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆8,336May 31, 2024Updated last year
- Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch☆1,987May 3, 2024Updated last year
- ☆3,438May 14, 2024Updated last year
- ☆7,846Apr 14, 2024Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆41,578Feb 7, 2026Updated last week
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆30,823Feb 4, 2026Updated last week
- Instant neural graphics primitives: lightning fast NeRF and more☆17,251Feb 2, 2026Updated last week
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆9,491Feb 6, 2026Updated last week
- Google Research☆37,233Feb 6, 2026Updated last week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,446Aug 12, 2024Updated last year