hauptdigital/deepspeech-notes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hauptdigital/deepspeech-notes)

hauptdigital / deepspeech-notes

DeepSpeechNotes is a note taking app using Mozilla's DeepSpeech technology to transcribe speech into text notes.

☆18

Alternatives and similar repositories for deepspeech-notes

Users that are interested in deepspeech-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 4 years ago
speechio / asr-noises
View on GitHub
A handy dataset of noises for ASR
☆22May 29, 2019Updated 7 years ago
kbabilinski / deep-speech-unity
View on GitHub
A Unity implementation of DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on …
☆26Sep 22, 2022Updated 3 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
vadimkantorov / inferspeech
View on GitHub
PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant
☆10Aug 12, 2019Updated 6 years ago
vadimkantorov / readaudio
View on GitHub
Read audio with FFmpeg into NumPy/PyTorch via ctypes (standard library module)
☆11Aug 12, 2020Updated 5 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
thorstenMueller / cTTS
View on GitHub
TTS Client for Coqui TTS server
☆13Jan 7, 2023Updated 3 years ago
Digital-Umuganda / text_normalization_tts_rw
View on GitHub
☆11Apr 24, 2024Updated 2 years ago
mikex86 / DeepSpeech-Java-Bindings
View on GitHub
Java Bindings for the C++ library DeepSpeech
☆10Jun 4, 2020Updated 6 years ago
LiveBaster / agifa
View on GitHub
"Artificial General Intelligence For All (AGIFA)" Project
☆12Feb 25, 2024Updated 2 years ago
falabrasil / ufpalign
View on GitHub
👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro
☆13Jul 18, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
kaustubh-iamplus / webrtc_vad
View on GitHub
Voice activity detection (VAD) library and Go bindings based on WebRTC's VAD engine
☆11Mar 1, 2018Updated 8 years ago
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago
coqui-ai / data-checker
View on GitHub
🫠 check your data, before you wreck your model
☆16Aug 11, 2022Updated 3 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
ccoreilly / LocalSTT
View on GitHub
Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech
☆111Jan 19, 2022Updated 4 years ago
egorsmkv / qirimtatar-tts-datasets
View on GitHub
Open Source Crimean Tatar Text-to-Speech datasets
☆14Feb 23, 2025Updated last year
asappresearch / multistream-cnn
View on GitHub
Multistream CNN for Robust Acoustic Modeling
☆40Jun 17, 2021Updated 5 years ago
isca-sig-rosp / ISCA-SIG-RoSP
View on GitHub
Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)
☆11Dec 4, 2023Updated 2 years ago
ffaisal93 / SD-QA
View on GitHub
☆16Feb 10, 2026Updated 5 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
tiefenauer / wiki-lm
View on GitHub
Script to train a German n-gram Language Model on articles of Wikipedia
☆14Oct 20, 2018Updated 7 years ago
Makerfabs / MaTouch-ESP32-S3-RotaryIPS-Display1.28-GC9A01
View on GitHub
☆11Mar 22, 2024Updated 2 years ago
7Xin / DPI-TTS
View on GitHub
☆13Sep 12, 2024Updated last year
cschaefer26 / StyleMelGAN
View on GitHub
☆10Apr 8, 2024Updated 2 years ago
erogol / ngi
View on GitHub
Fast trigram-indexed regex search for codebases — 2-6x faster than ripgrep
☆20Mar 24, 2026Updated 4 months ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
one-man-studios / Shinzo-UI
View on GitHub
A robust, single standalone html file AI interface for Ollama and OpenAI. Features local RAG with vector search, real-time voice/video ca…
☆16Jul 4, 2026Updated 2 weeks ago
mweinbach / parakeet-coreml-swift
View on GitHub
Swift package for on-device speech-to-text with NVIDIA Parakeet TDT 0.6B v3 compiled to Core ML. Runs on macOS 14+ / iOS 17+, selectable …
☆20Apr 22, 2026Updated 3 months ago
techiaith / macsen
View on GitHub
Cod ar gyfer 'Macsen' - prototeip o gynorthwyydd digidol Cymraeg i'r Raspberry Pi // Code for 'Macsen' - a prototype Welsh language digit…
☆11Mar 29, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
pilot7747 / VoxDIY
View on GitHub
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Jul 22, 2021Updated 5 years ago
freds0 / kabooks
View on GitHub
KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…
☆13Mar 24, 2023Updated 3 years ago
AIRI-Institute / AI4TALK
View on GitHub
☆13Dec 7, 2022Updated 3 years ago
hyperaudio / hyperaudio
View on GitHub
☆14Mar 31, 2023Updated 3 years ago
oplatek / kaldi-thesis
View on GitHub
Master thesis of Ondrej Platek: Automatic speech recognition using Kaldi. Supervised by Filip Jurcicek.
☆15Feb 20, 2020Updated 6 years ago
clulab / qup
View on GitHub
qup: a Single-Node Job Scheduler with NVIDIA GPU support
☆18Jan 10, 2023Updated 3 years ago
JRMeyer / speakerID-challenge
View on GitHub
A recipe for creating a Speaker Identification system built on Kaldi.
☆15Jan 2, 2020Updated 6 years ago