ReadAlongs / SoundSwallowerLinks
An even smaller speech recognizer / force aligner
☆35Updated 7 months ago
Alternatives and similar repositories for SoundSwallower
Users that are interested in SoundSwallower are comparing it to the libraries listed below
Sorting:
- The EveryVoice TTS Toolkit - Text To Speech for your language☆38Updated this week
- IPA Phonemizer/Dephonemizer for 139 human languages☆31Updated this week
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- Labeled data for homograph disambiguation☆59Updated 2 years ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆44Updated 5 years ago
- Coqui Inference Engine☆40Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated last year
- Audiobook alignment for Indigenous languages☆40Updated 2 weeks ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Web app for keyword spotting using TensorflowJS☆73Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 5 months ago
- Tracking states of the arts and recent results (bibliography) on sound tasks.☆32Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Launch your speech synthesis within one minute.☆12Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 9 months ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆35Updated 5 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- ☆12Updated 2 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆25Updated 4 months ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated last week
- Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorial☆41Updated 5 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆122Updated 8 months ago
- ☆22Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation☆21Updated 6 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆223Updated this week
- Uses ctypes and libespeak-ng to transform test into IPA phonemes☆21Updated last year