hauptdigital / deepspeech-notesLinks
DeepSpeechNotes is a note taking app using Mozilla's DeepSpeech technology to transcribe speech into text notes.
☆18Updated 3 years ago
Alternatives and similar repositories for deepspeech-notes
Users that are interested in deepspeech-notes are comparing it to the libraries listed below
Sorting:
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- ☆17Updated 2 years ago
- ☆11Updated 4 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Updated 4 years ago
- A handy dataset of noises for ASR☆22Updated 6 years ago
- A collection of utilities for handling IPA phones.☆26Updated 2 years ago
- ☆21Updated 7 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 4 years ago
- ☆20Updated 3 years ago
- ☆17Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆106Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Updated 6 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 3 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Updated last year
- ☆14Updated 10 years ago
- Evaluation of STT models for german language☆15Updated 4 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Updated 5 years ago
- ☆22Updated 4 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Updated 3 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆26Updated last year
- ☆37Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Updated 3 years ago
- Add n-gram and large language model (LLM) support to Whisper models.☆41Updated 9 months ago
- wake word spotting with kaldi☆19Updated 5 years ago
- ☆55Updated 3 years ago