ReadAlongs / SoundSwallowerLinks
An even smaller speech recognizer / force aligner
☆36Updated 10 months ago
Alternatives and similar repositories for SoundSwallower
Users that are interested in SoundSwallower are comparing it to the libraries listed below
Sorting:
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated last week
- Launch your speech synthesis within one minute.☆12Updated last year
- IPA Phonemizer/Dephonemizer for 139 human languages☆42Updated 3 weeks ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- Coqui Inference Engine☆41Updated 4 years ago
- Labeled data for homograph disambiguation☆60Updated 2 years ago
- Audiobook alignment for Indigenous languages☆42Updated this week
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆132Updated 11 months ago
- Web app for keyword spotting using TensorflowJS☆74Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆44Updated 5 years ago
- On-device voice activity detection (VAD) powered by deep learning☆232Updated last month
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated this week
- 🐍 Coqui's machine learning job scheduler☆31Updated 4 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 8 months ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆32Updated last year
- 📈 A forced aligner intended for synchronization of narrated text☆100Updated 2 months ago
- On-device noise suppression powered by deep learning☆76Updated 2 months ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆18Updated 6 months ago
- A curated list of awesome voice activity detection☆67Updated 11 months ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- ☆22Updated 4 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 6 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆50Updated 7 months ago
- ☆13Updated 10 years ago
- ☆17Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago