An even smaller speech recognizer / force aligner
☆36May 5, 2026Updated last month
Alternatives and similar repositories for SoundSwallower
Users that are interested in SoundSwallower are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Domain-specific programming language for linguistic grammars and transducers — Langage dédié pour les grammaires linguistiques et les tra…☆17Updated this week
- Audiobook alignment for Indigenous languages☆45Updated this week
- Suite of web packages for creating interactive ReadAlongs☆17Updated this week
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- Kaldi code for doing DNN with tensorflow☆13Feb 8, 2016Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆201Updated this week
- GUI applikation for the Klatt formant synthesizer package☆13May 24, 2026Updated 2 weeks ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆16Mar 26, 2022Updated 4 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆32Sep 22, 2017Updated 8 years ago
- Python Finite-State Toolkit☆67May 27, 2026Updated 2 weeks ago
- Implementation of Android's TextToSpeechService that provides Estonian text-to-speech☆17Jan 19, 2019Updated 7 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Jun 9, 2023Updated 3 years ago
- ACE View is a natural language based ontology and rule editor. ACE View uses Attempto Controlled English (ACE) in the front-end, and Web …☆10Dec 16, 2018Updated 7 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- ☆33Nov 27, 2021Updated 4 years ago
- A framework for overviewing the performance of F0 estimators☆19Sep 10, 2016Updated 9 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 11 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- Object detection... for fursuits!☆15Jan 12, 2023Updated 3 years ago
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 9 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Estonian text-to-speech text normalization pipeline☆13Dec 17, 2025Updated 5 months ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- A family of efficient speech models for multilingual phone recognition☆59Feb 12, 2026Updated 3 months ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17May 15, 2015Updated 11 years ago
- Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15☆12Apr 17, 2017Updated 9 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- Trained deep neural-net models for estimating articulatory keypoints from midsagittal ultrasound tongue videos and front-view lip camera …☆25Jun 13, 2023Updated 2 years ago
- Proposed splits for the LREC Wikipron paper☆15Apr 7, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Stop-go recording of audio in terminal☆12Feb 7, 2015Updated 11 years ago
- A retro InstallShield screen spoof☆10Sep 21, 2018Updated 7 years ago
- Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FST…☆184Apr 21, 2026Updated last month
- The EveryVoice TTS Toolkit - Text To Speech for your language☆44Updated this week
- ☆55Jan 13, 2023Updated 3 years ago
- ☆16May 8, 2025Updated last year
- Coqui STT (🐸STT) based forced alignment tool☆13Feb 24, 2022Updated 4 years ago