ReadAlongs / SoundSwallowerLinks
An even smaller speech recognizer / force aligner
☆37Updated last year
Alternatives and similar repositories for SoundSwallower
Users that are interested in SoundSwallower are comparing it to the libraries listed below
Sorting:
- Labeled data for homograph disambiguation☆63Updated 2 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated last month
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated 2 years ago
- IPA Phonemizer/Dephonemizer for 140 human languages☆51Updated last week
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆44Updated 5 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆135Updated last year
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆46Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆38Updated 11 months ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Audiobook alignment for Indigenous languages☆45Updated last week
- Coqui Inference Engine☆40Updated 4 years ago
- On-device noise suppression powered by deep learning☆80Updated 2 weeks ago
- Launch your speech synthesis within one minute.☆12Updated last year
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated this week
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- 📈 A forced aligner intended for synchronization of narrated text☆102Updated 5 months ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- Web app for keyword spotting using TensorflowJS☆74Updated 3 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆127Updated last year
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- On-device voice activity detection (VAD) powered by deep learning☆241Updated this week
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆186Updated last week
- TTS Client for Coqui TTS server☆13Updated 3 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Updated last year
- A curated list of awesome voice activity detection☆71Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆106Updated 2 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Updated last year
- ☆57Updated 2 years ago