ReadAlongs / SoundSwallower
An even smaller speech recognizer / force aligner
☆32Updated 3 months ago
Alternatives and similar repositories for SoundSwallower:
Users that are interested in SoundSwallower are comparing it to the libraries listed below
- The EveryVoice TTS Toolkit - Text To Speech for your language☆25Updated this week
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆111Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- Simple Diarization model☆47Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- ☆80Updated 10 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆21Updated 3 weeks ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated last week
- C++ version of pyannote audio speaker diarizaiton pipeline☆20Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 6 months ago
- Audiobook alignment for Indigenous languages☆39Updated this week
- (WIP) A retrain of F5-TTS on permissively-licensed data☆9Updated 3 weeks ago
- ☆17Updated last year
- IPA Phonemizer/Dephonemizer for 139 human languages☆21Updated this week
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 4 years ago
- Multilingual Grapheme to Phoneme☆49Updated 9 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆117Updated 4 months ago
- A curated list of awesome voice activity detection☆48Updated 4 months ago
- Unicode Standard tokenization routines and orthography profile segmentation☆35Updated last month
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated 11 months ago
- ☆25Updated 2 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆70Updated 7 months ago