ReadAlongs / SoundSwallower
An even smaller speech recognizer / force aligner
☆32Updated 4 months ago
Alternatives and similar repositories for SoundSwallower:
Users that are interested in SoundSwallower are comparing it to the libraries listed below
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your language☆26Updated this week
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Coqui Inference Engine☆39Updated 3 years ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆25Updated 3 weeks ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆126Updated 5 months ago
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 2 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆161Updated last week
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆22Updated last month
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 6 months ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- Labeled data for homograph disambiguation☆57Updated last year
- Uses ctypes and libespeak-ng to transform test into IPA phonemes☆20Updated last year
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆33Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- webrtcvad provides node.js bindings to the WebRTC voice activity detection library.☆31Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- On-device speaker diarization powered by deep learning☆44Updated last month
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)☆16Updated 7 years ago
- ☆17Updated 2 years ago
- ☆36Updated 10 months ago
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆42Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Audiobook alignment for Indigenous languages☆40Updated last week
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.☆28Updated 3 years ago
- Simple Diarization model☆47Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.☆117Updated 5 months ago