ReadAlongs / SoundSwallowerLinks
An even smaller speech recognizer / force aligner
☆33Updated 5 months ago
Alternatives and similar repositories for SoundSwallower
Users that are interested in SoundSwallower are comparing it to the libraries listed below
Sorting:
- The EveryVoice TTS Toolkit - Text To Speech for your language☆33Updated this week
- Coqui Inference Engine☆40Updated 3 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆27Updated last month
- Audiobook alignment for Indigenous languages☆40Updated last week
- Web app for keyword spotting using TensorflowJS☆71Updated 2 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆127Updated 6 months ago
- Launch your speech synthesis within one minute.☆12Updated last year
- Flask-based web framework for visualisation and explorative listening of audio.☆53Updated 2 years ago
- Buildings block for voice-enabled applications in the browser☆37Updated last month
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Lyra V2 (SoundStream) running in the browser☆18Updated last year
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 4 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated last month
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Labeled data for homograph disambiguation☆57Updated 2 years ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- Uses ctypes and libespeak-ng to transform test into IPA phonemes☆20Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 7 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆47Updated this week
- A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)☆16Updated 7 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 10 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago