lucidrains/nuwa-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/nuwa-pytorch)

lucidrains / nuwa-pytorch

Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch

☆548

Alternatives and similar repositories for nuwa-pytorch

Users that are interested in nuwa-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / NUWA
View on GitHub
A unified 3D Transformer Pipeline for visual synthesis
☆2,794May 29, 2023Updated 3 years ago
lucidrains / x-clip
View on GitHub
A concise but complete implementation of CLIP with various experimental improvements from recent papers
☆724Oct 16, 2023Updated 2 years ago
lucidrains / video-diffusion-pytorch
View on GitHub
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
☆1,384May 3, 2024Updated 2 years ago
Jack000 / DALLE-pytorch
View on GitHub
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
☆89Dec 3, 2021Updated 4 years ago
microsoft / VQ-Diffusion
View on GitHub
Official implementation of VQ-Diffusion
☆981Apr 17, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lucidrains / make-a-video-pytorch
View on GitHub
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
☆1,986May 3, 2024Updated 2 years ago
lucidrains / DALLE-pytorch
View on GitHub
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
☆5,629Feb 17, 2024Updated 2 years ago
lucidrains / flexible-diffusion-modeling-videos-pytorch
View on GitHub
Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…
☆85May 28, 2022Updated 4 years ago
lucidrains / rela-transformer
View on GitHub
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Apr 6, 2022Updated 4 years ago
crowsonkb / v-diffusion-pytorch
View on GitHub
v objective diffusion inference code for PyTorch.
☆719Nov 29, 2022Updated 3 years ago
kakaobrain / mindall-e
View on GitHub
PyTorch implementation of a 1.3B text-to-image generation model trained on 14 million image-text pairs
☆631Aug 9, 2022Updated 3 years ago
AranKomat / Diff-DALLE
View on GitHub
☆65Nov 4, 2021Updated 4 years ago
openai / glide-text2im
View on GitHub
GLIDE: a diffusion-based text-conditional image synthesis model
☆3,686Mar 8, 2024Updated 2 years ago
afiaka87 / pyglide
View on GitHub
A CLI tool for using GLIDE to generate images from text.
☆66May 5, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
lucidrains / phenaki-pytorch
View on GitHub
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch
☆790Jul 29, 2024Updated last year
afiaka87 / glide-finetune
View on GitHub
Finetune glide-text2im from openai on your own data.
☆88Feb 28, 2026Updated 4 months ago
tgisaturday / dalle-lightning
View on GitHub
Refactoring dalle-pytorch and taming-transformers for TPU VM
☆60Aug 30, 2021Updated 4 years ago
lucidrains / imagen-pytorch
View on GitHub
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
☆8,418Oct 7, 2024Updated last year
lucidrains / parti-pytorch
View on GitHub
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
☆538Dec 8, 2023Updated 2 years ago
CasualGANPapers / Make-A-Scene
View on GitHub
Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
☆335Aug 9, 2022Updated 3 years ago
lucidrains / retrieval-augmented-ddpm
View on GitHub
Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch
☆66May 5, 2022Updated 4 years ago
Jack000 / guided-diffusion
View on GitHub
☆28Dec 16, 2021Updated 4 years ago
autonomousvision / stylegan-xl
View on GitHub
[SIGGRAPH'22] StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets
☆994Jun 24, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
snap-research / MMVID
View on GitHub
[CVPR 2022] Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
☆191Jun 16, 2022Updated 4 years ago
gnobitab / FuseDream
View on GitHub
☆194Dec 7, 2021Updated 4 years ago
zai-org / CogView
View on GitHub
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
☆1,797Sep 25, 2023Updated 2 years ago
kingoflolz / CLIP_JAX
View on GitHub
Contrastive Language-Image Pretraining
☆147Sep 6, 2022Updated 3 years ago
EleutherAI / vqgan-clip
View on GitHub
☆355May 10, 2022Updated 4 years ago
self-distilled-stylegan / self-distilled-internet-photos
View on GitHub
☆242Apr 5, 2022Updated 4 years ago
CompVis / imagebart
View on GitHub
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
☆126Mar 14, 2022Updated 4 years ago
lucidrains / n-grammer-pytorch
View on GitHub
Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch
☆81Dec 4, 2022Updated 3 years ago
ai-forever / ru-dolph
View on GitHub
RUDOLPH: One Hyper-Tasking Transformer can be creative as DALL-E and GPT-3 and smart as CLIP
☆254Feb 6, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
pbaylies / clustering-laion400m
View on GitHub
Script and models for clustering LAION-400m CLIP embeddings.
☆26Jan 10, 2022Updated 4 years ago
eps696 / aphantasia
View on GitHub
CLIP + FFT/DWT/RGB = text to image/video
☆790Feb 13, 2025Updated last year
lucidrains / panoptic-transformer
View on GitHub
Another attempt at a long-context / efficient transformer by me
☆38Apr 11, 2022Updated 4 years ago
SHI-Labs / Versatile-Diffusion
View on GitHub
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
☆1,334Aug 10, 2023Updated 2 years ago
afiaka87 / clip-guided-diffusion
View on GitHub
A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
☆459Dec 31, 2025Updated 6 months ago
NightmareAI / majesty-diffusion
View on GitHub
Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)
☆25Jul 26, 2022Updated 3 years ago
LAION-AI / dalle2-laion
View on GitHub
Pretrained Dalle2 from laion
☆505Apr 15, 2023Updated 3 years ago