lucidrains / CLAPLinks
Contrastive Language-Audio Pretraining
☆15Updated 4 years ago
Alternatives and similar repositories for CLAP
Users that are interested in CLAP are comparing it to the libraries listed below
Sorting:
- ☆32Updated 3 years ago
- ☆16Updated 3 years ago
- Contrastive Language-Audio Pretraining☆88Updated 3 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆92Updated 3 years ago
- Anonymous ICLR Submission☆14Updated 6 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆43Updated 4 years ago
- PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)☆66Updated 4 years ago
- ☆10Updated last year
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Updated 4 years ago
- ☆22Updated 2 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Updated 3 years ago
- High performance pytorch modules☆18Updated 2 years ago
- Based on https://github.com/fatchord/WaveRNN☆24Updated 5 years ago
- A generative modelling toolkit for PyTorch.☆70Updated 4 years ago
- Local Attention - Flax module for Jax☆22Updated 4 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆88Updated last year
- PyTorch implementation of NVIDIA WaveGlow with constant memory cost.☆36Updated 2 years ago
- Evaluation script for VoxMovies dataset in PyTorch☆23Updated last year
- Temporary anonymous version☆22Updated last year
- Code for ICLR 2021 Paper, "Anytime Sampling for Autoregressive Models via Ordered Autoencoding"☆26Updated 2 years ago
- A Pytorch Implementations for Various Vector Quantization Methods☆32Updated 4 years ago
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30Updated 3 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 4 years ago
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- Voice swapping with VQ-VAE and diffusion models☆67Updated 3 years ago
- Fast and differentiable hidden Markov model in C++☆17Updated 2 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 3 years ago
- Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch☆42Updated 2 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago