yk / clip_music_videoLinks
Code for making music videos using CLIP
☆174Updated 4 years ago
Alternatives and similar repositories for clip_music_video
Users that are interested in clip_music_video are comparing it to the libraries listed below
Sorting:
- ☆234Updated 2 years ago
- Repository for "Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search"☆178Updated 3 years ago
- ☆152Updated last year
- ☆351Updated 3 years ago
- Here is a collection of checkpoints for DALLE-pytorch models, from where you can keep on training or start generating images.☆146Updated 2 years ago
- Open-AI's DALL-E for large scale training in mesh-tensorflow.☆434Updated 3 years ago
- v objective diffusion inference code for JAX.☆214Updated 3 years ago
- JAX implementation of VQGAN☆93Updated 3 years ago
- Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt☆137Updated last year
- code for reproducing some of the diagrams in the paper "Multimodal Neurons in Artificial Neural Networks"☆308Updated 4 years ago
- Learning to ground explanations of affect for visual art.☆316Updated 4 years ago
- Contrastive Language-Image Pretraining☆143Updated 2 years ago
- Image Synthesis + Corgis = <3☆87Updated 3 years ago
- v objective diffusion inference code for PyTorch.☆718Updated 2 years ago
- RUDOLPH: One Hyper-Tasking Transformer can be creative as DALL-E and GPT-3 and smart as CLIP☆254Updated 2 years ago
- ☆198Updated 3 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆89Updated 3 years ago
- Playing around with stable diffusion. Generated images are reproducible because I save the metadata and latent information. You can gener…☆207Updated 2 years ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Updated 3 years ago
- A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.☆462Updated 3 years ago
- Minimal standalone example of diffusion model☆159Updated 3 years ago
- Optimized library for large-scale extraction of frames and audio from video.☆204Updated last year
- ☆275Updated 3 years ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆550Updated 2 years ago
- Implementation of TransGanFormer, an all-attention GAN that combines the finding from the recent GanFormer and TransGan paper☆153Updated 4 years ago
- ☆57Updated 3 years ago
- A collection of Jupyter notebooks to play with NVIDIA's StyleGAN3 and OpenAI's CLIP for a text-based guided image generation.☆211Updated 3 years ago
- StyleGAN2 with adaptive discriminator augmentation (ADA) - Official TensorFlow implementation☆76Updated 4 years ago
- ☆64Updated 3 years ago
- Using CLIP and StyleGAN to generate faces from prompts.☆131Updated 3 years ago