Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
☆104Nov 26, 2022Updated 3 years ago
Alternatives and similar repositories for speech-representations
Users that are interested in speech-representations are comparing it to the libraries listed below
Sorting:
- ☆25Mar 12, 2022Updated 3 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆368Oct 12, 2021Updated 4 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Jan 14, 2021Updated 5 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Jan 5, 2026Updated last month
- ☆17Nov 25, 2019Updated 6 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆173Jul 1, 2020Updated 5 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Non-Autoregressive Predictive Coding☆51Nov 3, 2020Updated 5 years ago
- Moved to https://github.com/k2-fsa/icefall☆146Oct 13, 2022Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Oct 19, 2022Updated 3 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Recurrent Neural Aligner☆51Apr 14, 2020Updated 5 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- ☆19Mar 22, 2024Updated last year
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆379Jul 21, 2022Updated 3 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- A library for speech data augmentation in time-domain☆683Aug 30, 2021Updated 4 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Jun 8, 2022Updated 3 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- Official code for Cotatron @ INTERSPEECH 2020☆214Jul 25, 2024Updated last year
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Dec 10, 2020Updated 5 years ago
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 2 years ago
- Various algorithms for voice activity detection☆22Jan 31, 2017Updated 9 years ago
- ☆276Jan 15, 2021Updated 5 years ago
- Problem Agnostic Speech Encoder☆447Jul 6, 2023Updated 2 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,533Jun 13, 2025Updated 8 months ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Mar 24, 2023Updated 2 years ago
- ☆15May 8, 2021Updated 4 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Text frontend for ESPnet tts recipes☆34Jun 1, 2021Updated 4 years ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆344Dec 25, 2020Updated 5 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆50May 19, 2021Updated 4 years ago
- A CSRankings-like index for speech researchers☆35Oct 16, 2024Updated last year
- Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch☆58May 3, 2020Updated 5 years ago