ReadAlongs/SoundSwallower

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ReadAlongs/SoundSwallower)

ReadAlongs / SoundSwallower

An even smaller speech recognizer / force aligner

☆36

Alternatives and similar repositories for SoundSwallower

Users that are interested in SoundSwallower are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nrc-cnrc / gramble
View on GitHub
Domain-specific programming language for linguistic grammars and transducers — Langage dédié pour les grammaires linguistiques et les tra…
☆17Updated this week
ReadAlongs / Studio
View on GitHub
Audiobook alignment for Indigenous languages
☆45Jun 26, 2026Updated 3 weeks ago
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
ReadAlongs / Studio-Web
View on GitHub
Suite of web packages for creating interactive ReadAlongs
☆16Updated this week
projecte-aina / oTranscribe-plus
View on GitHub
A free & open tool for transcribing audio interviews with offline ASR support
☆25Dec 21, 2023Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
NRC-ILT / g2p
View on GitHub
Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
☆203Jul 10, 2026Updated last week
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
benob / openfst-utils
View on GitHub
Utilities for manipulating finite state transducers with the OpenFst library.
☆32Sep 22, 2017Updated 8 years ago
mhulden / pyfoma
View on GitHub
Python Finite-State Toolkit
☆68Updated this week
Kaljurand / EKISpeak
View on GitHub
Implementation of Android's TextToSpeechService that provides Estonian text-to-speech
☆17Jan 19, 2019Updated 7 years ago
jessuni / SafeColor
View on GitHub
Generate a consistent color from a string, or generate a random color from a given color. Both accessible, contrast safe, WCAG success cr…
☆16Apr 27, 2021Updated 5 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
xinjli / transphone
View on GitHub
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
☆174Jun 9, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
egorsmkv / asr-corpus-creator
View on GitHub
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Feb 15, 2024Updated 2 years ago
daanzu / wenet_stt_python
View on GitHub
☆33Nov 27, 2021Updated 4 years ago
dogancan / expected-edit-distance
View on GitHub
Expected edit distance implementation using OpenFst tools
☆11May 13, 2015Updated 11 years ago
Kaljurand / aceview
View on GitHub
ACE View is a natural language based ontology and rule editor. ACE View uses Attempto Controlled English (ACE) in the front-end, and Web …
☆10Dec 16, 2018Updated 7 years ago
markusdr / transducersaurus
View on GitHub
Automatically exported from code.google.com/p/transducersaurus
☆11Apr 1, 2015Updated 11 years ago
mmorise / tusk
View on GitHub
A framework for overviewing the performance of F0 estimators
☆19Sep 10, 2016Updated 9 years ago
bajibabu / GlottGAN
View on GitHub
This repository contains the files used for our Interspeech 2017 paper.
☆16May 30, 2017Updated 9 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
patyork / AutomaticSpeechChunker
View on GitHub
From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…
☆17May 15, 2015Updated 11 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
TartuNLP / tts_preprocess_et
View on GitHub
Estonian text-to-speech text normalization pipeline
☆14Dec 17, 2025Updated 7 months ago
MLSpeech / DeepPhoneticToolsTutorial
View on GitHub
Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15
☆12Apr 17, 2017Updated 9 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
digitallinguistics / data-format
View on GitHub
The Data Format for Digital Linguistics (DaFoDiL)
☆21Feb 7, 2023Updated 3 years ago
jacquelineCelia / lexicon_discovery
View on GitHub
Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL
☆10Aug 11, 2016Updated 9 years ago
articulateinstruments / DeepLabCut-for-Speech-Production
View on GitHub
Trained deep neural-net models for estimating articulatory keypoints from midsagittal ultrasound tongue videos and front-view lip camera …
☆25Jun 13, 2023Updated 3 years ago
CUNY-CL / wikipron-modeling
View on GitHub
Proposed splits for the LREC Wikipron paper
☆15Apr 7, 2020Updated 6 years ago
createthis / diffcalculia
View on GitHub
☆16May 8, 2025Updated last year
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ucam-smt / ucam-smt
View on GitHub
Cambridge SMT System
☆18Aug 1, 2017Updated 8 years ago
gullabi / STT-align
View on GitHub
Coqui STT (🐸STT) based forced alignment tool
☆13Feb 24, 2022Updated 4 years ago
skinahan / DIVA_PyTorch
View on GitHub
Implementation of the DIVA model of speech acquisition and production using PyTorch
☆23Jan 18, 2023Updated 3 years ago
EveryVoiceTTS / EveryVoice
View on GitHub
The EveryVoice TTS Toolkit - Text To Speech for your language
☆43Updated this week
shbhrsaha / dictaphone
View on GitHub
Stop-go recording of audio in terminal
☆12Feb 7, 2015Updated 11 years ago
TalnUPF / praat_web
View on GitHub
☆13Jun 30, 2026Updated 3 weeks ago
brendano / parseviz
View on GitHub
Visualize constituent and dependency parses as PDF or image formats, through GraphViz.
☆32Feb 11, 2021Updated 5 years ago