An even smaller speech recognizer / force aligner
☆37Mar 31, 2026Updated last month
Alternatives and similar repositories for SoundSwallower
Users that are interested in SoundSwallower are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Domain-specific programming language for linguistic grammars and transducers — Langage dédié pour les grammaires linguistiques et les tra…☆17Apr 15, 2026Updated 2 weeks ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Audiobook alignment for Indigenous languages☆45Apr 23, 2026Updated last week
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- Kaldi code for doing DNN with tensorflow☆13Feb 8, 2016Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆199Updated this week
- GUI applikation for the Klatt formant synthesizer package☆12Feb 16, 2026Updated 2 months ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆32Sep 22, 2017Updated 8 years ago
- Implementation of Android's TextToSpeechService that provides Estonian text-to-speech☆17Jan 19, 2019Updated 7 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 2 years ago
- ACE View is a natural language based ontology and rule editor. ACE View uses Attempto Controlled English (ACE) in the front-end, and Web …☆10Dec 16, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- ☆33Nov 27, 2021Updated 4 years ago
- A framework for overviewing the performance of F0 estimators☆19Sep 10, 2016Updated 9 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 8 years ago
- Estonian text-to-speech text normalization pipeline☆12Dec 17, 2025Updated 4 months ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- A family of efficient speech models for multilingual phone recognition☆57Feb 12, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17May 15, 2015Updated 10 years ago
- Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15☆12Apr 17, 2017Updated 9 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- Trained deep neural-net models for estimating articulatory keypoints from midsagittal ultrasound tongue videos and front-view lip camera …☆24Jun 13, 2023Updated 2 years ago
- Proposed splits for the LREC Wikipron paper☆15Apr 7, 2020Updated 6 years ago
- Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FST…☆182Apr 21, 2026Updated last week
- Stop-go recording of audio in terminal☆12Feb 7, 2015Updated 11 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆43Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆55Jan 13, 2023Updated 3 years ago
- ☆16May 8, 2025Updated 11 months ago
- Coqui STT (🐸STT) based forced alignment tool☆13Feb 24, 2022Updated 4 years ago
- Offline Automatic Speech Recognition for Android 2.2+ using Sphinx (specifically PocketSphinx)☆20Jun 22, 2014Updated 11 years ago
- ☆13Nov 16, 2022Updated 3 years ago
- The Data Format for Digital Linguistics (DaFoDiL)☆21Feb 7, 2023Updated 3 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago