Generated Audio Samples by ALGAN-VC model are available in the folder
☆19Feb 25, 2022Updated 4 years ago
Alternatives and similar repositories for ALGAN-VC-Generated-Audio-Samples
Users that are interested in ALGAN-VC-Generated-Audio-Samples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Mar 24, 2022Updated 4 years ago
- GAN series for voice conversion on VCC2018 dataset☆17Aug 27, 2020Updated 5 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Mar 29, 2021Updated 5 years ago
- A vocal source separation☆38Feb 2, 2025Updated last year
- Calculation of MCD (dB) between two speech waveforms☆57Sep 26, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)☆59Jul 26, 2022Updated 3 years ago
- ☆42Mar 25, 2022Updated 4 years ago
- Emotional Speech Conversion using Nonparallel Data☆17Apr 10, 2019Updated 7 years ago
- Mel cepstral distortion (MCD) computations in python. Use Merlin toolkit to convert .wav files to .gcm files. Work in all form of .wav fi…☆21Sep 4, 2020Updated 5 years ago
- Implementation of Emo-StarGAN☆46Dec 19, 2023Updated 2 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- ☆67Apr 3, 2023Updated 3 years ago
- ☆14Apr 2, 2023Updated 3 years ago
- SoTA open-source TTS☆23Jun 17, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Cherokee Audio data☆11Dec 24, 2023Updated 2 years ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 3 years ago
- pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020☆30Jul 6, 2023Updated 2 years ago
- CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer☆35Feb 4, 2025Updated last year
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆45Nov 3, 2021Updated 4 years ago
- Orpheus TTS Server with streaming support (TTFB ~160ms)☆24Sep 21, 2025Updated 6 months ago
- A NVIDIA's Pytorch Tacotron2 adaptation with unsupervised Global Style Tokens. The model has been trained with the English read-speech LJ…☆10Sep 4, 2023Updated 2 years ago
- A pytorch implementation of StarGAN-VC2☆150Sep 11, 2020Updated 5 years ago
- ☆17Oct 16, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Enabling interactive plotting of the visualizations from the SHAP project.☆23Jan 15, 2020Updated 6 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆87Dec 31, 2022Updated 3 years ago
- A sequence-to-sequence voice conversion toolkit.☆110Mar 15, 2026Updated 3 weeks ago
- A Beamerposter template with University of Cambridge logo and colors. It is forked from Gemini.☆20Nov 27, 2024Updated last year
- Text to Speech with PyTorch (English and Mongolian)☆13May 3, 2020Updated 5 years ago
- Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.☆33Jan 10, 2022Updated 4 years ago
- Talking head animation☆28Dec 8, 2023Updated 2 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆125Jun 16, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆83Nov 4, 2022Updated 3 years ago
- Voice conversion using deep adversarial learning☆17Oct 29, 2021Updated 4 years ago
- used to evaluate wavenet vocoder by rmse f0, MCD, rmse ap...☆15Jan 20, 2020Updated 6 years ago
- Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.☆23Mar 14, 2019Updated 7 years ago
- Integration of Fastspeech Text to Mel generation and fast Vocoder Squeezewave☆20Jul 6, 2023Updated 2 years ago
- ☆18Jan 31, 2023Updated 3 years ago
- A Python package for making excalidraw figures procedurally from python.☆23Oct 16, 2023Updated 2 years ago