☆21Jun 1, 2021Updated 4 years ago
Alternatives and similar repositories for SpeechNet-codebase
Users that are interested in SpeechNet-codebase are comparing it to the libraries listed below
Sorting:
- ☆15Sep 9, 2021Updated 4 years ago
- Taiwanese Speech Synthesis with Tacotron2☆25Oct 2, 2022Updated 3 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- ☆100Jul 22, 2021Updated 4 years ago
- ☆23Dec 10, 2024Updated last year
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 8 years ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆16Jul 12, 2021Updated 4 years ago
- Unofficial PyTorch Implementation of StarGAN-ZSVC☆14Aug 5, 2021Updated 4 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆34Apr 26, 2021Updated 4 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Jun 27, 2018Updated 7 years ago
- Audio samples from ICML2019 "Almost Unsupervised Text to Speech and Automatic Speech Recognition"☆17May 14, 2019Updated 6 years ago
- hugginface albert model and its tokenizer☆15Mar 12, 2020Updated 5 years ago
- ☆19Feb 2, 2023Updated 3 years ago
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆117May 27, 2021Updated 4 years ago
- dinglingling, your program over!☆18Mar 27, 2020Updated 5 years ago
- Toolbox for easy and qualitative one-shot voice conversion☆46Dec 5, 2021Updated 4 years ago
- Implementation of "Audio Retrieval with Natural Language Queries", INTERSPEECH 2021, PyTorch☆26Aug 18, 2023Updated 2 years ago
- The official implementation of the paper "Defending Your Voice: Adversarial Attack on Voice Conversion".☆52May 15, 2024Updated last year
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆25Sep 26, 2023Updated 2 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Aug 15, 2022Updated 3 years ago
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆24Oct 11, 2024Updated last year
- TTS Text Analyzer☆32Jul 20, 2023Updated 2 years ago
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆29Mar 3, 2022Updated 3 years ago
- voice conversion system☆25Jun 10, 2020Updated 5 years ago
- 比赛相关的实践☆21Sep 18, 2019Updated 6 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- Real-Time High-Fidelity Speech Synthesis without GPU☆73Jul 29, 2024Updated last year
- A PyTorch implementation of the universal neural vocoder☆67Nov 6, 2020Updated 5 years ago
- ☆69Mar 31, 2021Updated 4 years ago
- ☆38Apr 15, 2024Updated last year
- ☆11Apr 20, 2020Updated 5 years ago
- Encoder and Decoder and Attention Based Prosody Prediction☆68Jan 17, 2018Updated 8 years ago
- A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023☆38Sep 9, 2023Updated 2 years ago
- text-to-audio-latent-diffusion☆37Aug 25, 2023Updated 2 years ago
- Python 汉字到粤拼转换工具。☆35Feb 26, 2024Updated 2 years ago
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆38Nov 30, 2023Updated 2 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 7 years ago