VQCPC-GAN: Variable-length Adversarial Audio Synthesis using Vector-Quantized Contrastive Predictive Coding
β14Apr 27, 2021Updated 5 years ago
Alternatives and similar repositories for vqcpc-gan
Users that are interested in vqcpc-gan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Make musical loops in the browser using WaveGAN, GANSynth, and MusicVAEβ35Nov 17, 2022Updated 3 years ago
- πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decompositionβ14Nov 15, 2025Updated 6 months ago
- singing voice with annotations of vocal onsets, based on the matched MIDI from http://colinraffel.com/projects/lmd/β20Dec 30, 2019Updated 6 years ago
- mmyunβ17Aug 4, 2025Updated 9 months ago
- An architecture that makes any doodle realistic, in any specified style, using VQGAN, CLIP and some basic embedding arithmetics.β12Jun 18, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.β17Aug 8, 2021Updated 4 years ago
- Reimplementation of speech decoding 2022 paper by MetaAIβ14Oct 17, 2023Updated 2 years ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Convβ¦β21Sep 18, 2023Updated 2 years ago
- β21Mar 15, 2023Updated 3 years ago
- Rainbowgram with Pythonβ13Jan 28, 2019Updated 7 years ago
- Repository of TrΓ€umerAI, based on PyTorch implementation of StyleGAN 2β31Aug 1, 2021Updated 4 years ago
- β24Mar 24, 2023Updated 3 years ago
- Official PyTorch implementation for "Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations".β21Dec 3, 2021Updated 4 years ago
- A Python framework for immersive spatial audio simulation and education.β12Aug 19, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Github repository for Plot and Rework: Modeling Storylines for Visual Storytelling (ACL-IJCNLP2021 Findings)β22Aug 22, 2022Updated 3 years ago
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"β29Sep 6, 2023Updated 2 years ago
- This is an implementation of the audio source separation model as well as the evaluation metrics proposed in the paper "Weakly Informed Aβ¦β12Nov 26, 2019Updated 6 years ago
- π΅ Partnership with AI to create Beatsβ10Oct 13, 2020Updated 5 years ago
- Simple WebGL post-processing using some pieces from stack.glβ14Nov 21, 2014Updated 11 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencodersβ123Nov 21, 2022Updated 3 years ago
- Code for our paper "Towards realistic MIDI instrument synthesizers" (https://cs.stanford.edu/~rjcaste/research/realistic_midi.pdf)β19Oct 27, 2021Updated 4 years ago
- Framework for one-shot multispeaker system based on Deep Learningβ19May 30, 2021Updated 4 years ago
- An imporved version of Fastsinging singing voice synthesising system.β21Nov 3, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Reproduction of "Scyclone" with PyTorchβ16Jan 6, 2021Updated 5 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weightsβ19Oct 9, 2022Updated 3 years ago
- A pitch detection model trained to be robust against noise and reverberation environments.β27Jan 21, 2025Updated last year
- β35Sep 7, 2022Updated 3 years ago
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.β11Apr 22, 2020Updated 6 years ago
- MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answeringβ¦β13Feb 18, 2023Updated 3 years ago
- β10Mar 10, 2021Updated 5 years ago
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"β14May 31, 2023Updated 2 years ago
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instanceβ28Mar 1, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β42Jun 2, 2020Updated 5 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021β60Oct 19, 2022Updated 3 years ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.β36Sep 21, 2022Updated 3 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorchβ12Jan 16, 2022Updated 4 years ago
- A web app for annotating Freesound loops, and the tools to analyse the dataset created.β20Jul 6, 2023Updated 2 years ago
- It is an extension for the Animation Nodes add-on. It has extra nodes that enhance the functionality of the Animation Nodes.β11Dec 15, 2020Updated 5 years ago
- Perceptually uniform colormaps with full range of lightness.β17Jun 23, 2024Updated last year