mlcommons / peoples-speechLinks
The People’s Speech Dataset
☆104Updated last year
Alternatives and similar repositories for peoples-speech
Users that are interested in peoples-speech are comparing it to the libraries listed below
Sorting:
- ☆68Updated 3 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆142Updated last year
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated 2 years ago
- Implementation of audio degradation processes☆103Updated 9 years ago
- Multistream CNN for Robust Acoustic Modeling☆40Updated 4 years ago
- Example code for a neural transducer model.☆64Updated last year
- ☆37Updated 2 months ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆85Updated 6 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆94Updated last year
- Memory efficient transducer loss computation☆68Updated 3 years ago
- Recurrent Neural Aligner☆50Updated 5 years ago
- ☆76Updated 3 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Updated 2 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆67Updated 2 months ago
- Various speech datasets made available to the public☆123Updated 7 months ago
- Alignment files of LibriTTS.☆64Updated 5 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Updated 2 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 3 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Updated 3 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆30Updated 5 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- Python wrapper for kaldi's arpa2fst☆38Updated 7 months ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆141Updated 2 years ago
- Small compression utility☆37Updated 3 months ago