bene-ges / nemo_compatibleLinks
useful things that work with NVIDIA NeMo library
☆14Updated last year
Alternatives and similar repositories for nemo_compatible
Users that are interested in nemo_compatible are comparing it to the libraries listed below
Sorting:
- ☆11Updated 2 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 4 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆44Updated 4 years ago
- Template for creating audio encoders compatible with X-ARES☆17Updated last month
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆74Updated 7 months ago
- Objective metrics used in several text-to-speech (TTS) papers.☆52Updated 7 months ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆21Updated last year
- ☆49Updated 5 years ago
- MOS score prediction by fine-tuned wav2vec2.0 model☆174Updated 3 years ago
- Alignment files of LibriTTS.☆66Updated 5 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆73Updated 4 years ago
- Implementation of the AlignTTS☆77Updated 2 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆46Updated 8 months ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 3 years ago
- A toolkit for any-to-any encoder-decoder voice conversion systems☆84Updated 2 years ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Updated 3 years ago
- ☆26Updated last year
- ☆77Updated 6 months ago
- Tacotron2 with Global Style Tokens☆65Updated 6 years ago
- ☆69Updated 4 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆53Updated last year
- ☆32Updated last year
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆48Updated 8 months ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Updated 2 years ago
- A CSRankings-like index for speech researchers☆35Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆46Updated last year
- A list of papers for child ASR☆50Updated last year
- Official repository for the WenetSpeech-Chuan dataset.☆134Updated last month
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 3 years ago
- FNSE-SBGAN: Far-field Speech Enhancement with Schrödinger Bridge and Generative Adversarial Networks☆16Updated 8 months ago