alumae / torch-xvectors-wav
☆22Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for torch-xvectors-wav
- End-to-end diarization loss☆22Updated 3 years ago
- Prosodic Speech Segmentation with Transformers☆23Updated 8 months ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- ☆17Updated last year
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated last month
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 2 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 2 months ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆23Updated 3 years ago
- ☆12Updated 3 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- A collection of papers related to speech model compression☆24Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆18Updated 8 months ago
- ☆33Updated 2 years ago
- ☆16Updated 2 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated 8 months ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆35Updated last month
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆29Updated 3 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- ☆10Updated last year
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆13Updated last month
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- ☆26Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- 60k hours of phoneme-aligned audio from audio books☆18Updated 3 months ago
- A library of speech gadgets.☆13Updated 2 years ago