bene-ges / nemo_compatibleLinks
useful things that work with NVIDIA NeMo library
☆13Updated last year
Alternatives and similar repositories for nemo_compatible
Users that are interested in nemo_compatible are comparing it to the libraries listed below
Sorting:
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 4 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆44Updated 4 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆62Updated last year
- ☆70Updated last year
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆23Updated 8 months ago
- ☆10Updated 3 years ago
- [INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement Diffusion Model"☆52Updated this week
- Template for creating audio encoders compatible with X-ARES☆11Updated 9 months ago
- ☆19Updated last year
- The VoxTube dataset official repository☆70Updated last year
- Objective metrics used in several text-to-speech (TTS) papers.☆50Updated 4 months ago
- ☆37Updated 4 years ago
- Official repository for the WenetSpeech-Chuan dataset.☆67Updated 2 weeks ago
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆51Updated last year
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆79Updated 11 months ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Updated 3 years ago
- ☆70Updated 4 months ago
- ☆11Updated 2 years ago
- ☆63Updated last year
- ☆12Updated last month
- ☆81Updated 9 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Updated 2 years ago
- ☆68Updated 2 years ago
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 3 years ago
- MOS score prediction by fine-tuned wav2vec2.0 model☆170Updated 3 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆43Updated 5 months ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆55Updated 2 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- ☆18Updated last year
- Speech samples and code of BEdit-TTS☆34Updated 2 years ago