alumae / voxlingua107_sbLinks
VoxLingua107 recipe for SpeechBrain
☆13Updated 4 years ago
Alternatives and similar repositories for voxlingua107_sb
Users that are interested in voxlingua107_sb are comparing it to the libraries listed below
Sorting:
- Online streaming speaker change detection model in Pytorch☆40Updated 2 years ago
- ☆33Updated 3 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆43Updated 3 years ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆40Updated 4 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 4 years ago
- ☆54Updated last year
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆53Updated 2 months ago
- Neural network-based forced alignment with bidirectional attention mechanism☆77Updated 6 months ago
- Pronunciation-assisted Subword Modeling☆29Updated 6 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆24Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆46Updated 2 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆40Updated 2 years ago
- Went online decode demo☆30Updated 4 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆140Updated last month
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆40Updated 4 years ago
- ☆25Updated 8 months ago
- ☆56Updated 2 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
- Text frontend for ESPnet tts recipes☆34Updated 4 years ago
- Streaming Audiotransformers for online Audio tagging☆45Updated last year
- ☆25Updated 11 months ago
- multilingual speech aligner☆74Updated last year
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆48Updated 6 months ago
- Pre-trained grapheme-to-phoneme (G2P) models☆25Updated 3 years ago
- ☆25Updated 8 months ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago