stefantaubert / mean-opinion-scoreView external linksLinks
Python library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings according to Ribeiro et al. (2011).
☆24Jan 31, 2025Updated last year
Alternatives and similar repositories for mean-opinion-score
Users that are interested in mean-opinion-score are comparing it to the libraries listed below
Sorting:
- Run Retrieval-based Voice Conversion training and inference with ease.☆11Jan 24, 2025Updated last year
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Nov 9, 2023Updated 2 years ago
- ☆14Aug 1, 2025Updated 6 months ago
- 4G GPU & 10 Minutes for train☆12Aug 9, 2023Updated 2 years ago
- ☆14Aug 19, 2024Updated last year
- Zalo Text-To-Speech for python☆11May 10, 2021Updated 4 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆64Aug 24, 2025Updated 5 months ago
- PyTorch implementation of simplified neural source filter model (s-nsf)☆14Aug 4, 2021Updated 4 years ago
- Multilingual-Speech-Synthesis-Voice-Conversion Using Bark + RVC☆14Apr 19, 2025Updated 9 months ago
- ☆16Dec 12, 2023Updated 2 years ago
- ☆19May 2, 2024Updated last year
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Sep 18, 2023Updated 2 years ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Sep 21, 2025Updated 4 months ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆80May 29, 2023Updated 2 years ago
- Sovits5 with RMVPE☆14Jul 17, 2023Updated 2 years ago
- Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'☆50May 17, 2022Updated 3 years ago
- Mel cepstral distortion (MCD) computations in python. Use Merlin toolkit to convert .wav files to .gcm files. Work in all form of .wav fi…☆21Sep 4, 2020Updated 5 years ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆21Oct 8, 2024Updated last year
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆24Feb 17, 2023Updated 2 years ago
- Few-shot multilingual tts with RVC and Vits☆52Jun 15, 2023Updated 2 years ago
- Official Code for ParrotTTS☆58Oct 13, 2024Updated last year
- Code for CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning☆23Jul 12, 2022Updated 3 years ago
- Prosody and Pronunciation Modification Network☆62May 5, 2025Updated 9 months ago
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆27May 25, 2023Updated 2 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.☆23Mar 14, 2019Updated 6 years ago
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models☆175Dec 18, 2023Updated 2 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆11Jun 28, 2022Updated 3 years ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆132Oct 2, 2025Updated 4 months ago
- The deme page of InstructTTS☆157Feb 10, 2024Updated 2 years ago
- Deep Noise Suppression for Real Time Speech Enhancement in a Single Channel Wide Band Scenario☆27Jan 25, 2024Updated 2 years ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Apr 23, 2024Updated last year
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Aug 27, 2023Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36May 1, 2024Updated last year
- A flexible sentence segmentation library using CRF model and regex rules☆31Oct 5, 2025Updated 4 months ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆35Feb 11, 2025Updated last year
- Diffusion-based singing voice pitch correction☆135Sep 20, 2024Updated last year
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆34Jan 23, 2024Updated 2 years ago