Wataru-Nakata / miipher
Unofficial implementation of miipher
☆111Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for miipher
- Reference-aware automatic speech evaluation toolkit☆106Updated 8 months ago
- Easy-to-Use Speech MOS predictors☆227Updated last year
- UT-Sarulab MOS prediction system using SSL models☆183Updated 6 months ago
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆114Updated 4 months ago
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆179Updated 2 months ago
- UTokyo-SaruLab MOS Prediction System☆83Updated this week
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆166Updated 6 months ago
- A sequence-to-sequence voice conversion toolkit.☆85Updated 4 months ago
- MOS score prediction by fine-tuned wav2vec2.0 model☆146Updated 2 years ago
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆133Updated last year
- ☆100Updated last month
- It's a repository for implementations of neural speech editing algorithms.☆191Updated 10 months ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆189Updated 2 years ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆143Updated last year
- SelfRemaster: SSL Speech Restoration☆84Updated 10 months ago
- Implementation of SoundStorm built upon SpeechTokenizer.☆103Updated last year
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆122Updated 4 months ago
- HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform☆135Updated last year
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆136Updated last year
- Train the next generation of TTS systems.☆160Updated last month
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆46Updated 5 months ago
- Unofficial implementation of NVIDIA P-Flow TTS paper☆217Updated 4 months ago
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Updated 2 weeks ago
- The open source code for SimpleSpeech series☆108Updated last month
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆157Updated 3 months ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆96Updated 4 months ago
- ☆123Updated last month
- ☆110Updated 2 years ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆93Updated last year
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆57Updated last year