carlfm01 / my-speech-datasetsLinks
My public domain speech index
☆13Updated 6 years ago
Alternatives and similar repositories for my-speech-datasets
Users that are interested in my-speech-datasets are comparing it to the libraries listed below
Sorting:
- BurrMill core☆22Updated 4 years ago
- ☆19Updated 3 years ago
- Simple Kaldi recipe for forced alignment☆11Updated 2 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Updated 7 years ago
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Updated 5 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- VoxAngeles Corpus☆13Updated 4 months ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 3 years ago
- Official source for Catalan Language Models and resources made within Aina project.☆26Updated 2 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated 3 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Updated 6 years ago
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Updated 3 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated 2 years ago
- An automatic speech recognition environment for Icelandic based on Kaldi☆14Updated 8 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 5 years ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 4 years ago
- IPA tokeniser☆17Updated 5 months ago
- Pronounce Arabic words☆19Updated 6 years ago
- ☆17Updated 6 years ago
- Linguistic processing for Common Voice☆58Updated last year
- ☆10Updated 4 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Updated 3 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- ☆17Updated 4 years ago
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Updated 3 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 3 years ago
- Calculates the Word Error Rate between two text files☆20Updated 3 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago