Coqui STT (πΈSTT) based forced alignment tool
β13Feb 24, 2022Updated 4 years ago
Alternatives and similar repositories for STT-align
Users that are interested in STT-align are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β13Nov 16, 2022Updated 3 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forestsβ15Jan 24, 2017Updated 9 years ago
- A free & open tool for transcribing audio interviews with offline ASR supportβ25Dec 21, 2023Updated 2 years ago
- Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)β10Jun 2, 2021Updated 5 years ago
- Expected edit distance implementation using OpenFst toolsβ11May 13, 2015Updated 11 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- steps to perform text-based speaker diarization with kaldi toolkitβ12Nov 2, 2018Updated 7 years ago
- Generate an accurate, timestamped transcript given an audio file and its text using Google Cloud's Speech-to-Text API via gRPC.β21Aug 16, 2020Updated 5 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based β¦β16Sep 5, 2017Updated 8 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-coreβ15Jun 19, 2023Updated 2 years ago
- Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15β12Apr 17, 2017Updated 9 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACLβ10Aug 11, 2016Updated 9 years ago
- Proposed splits for the LREC Wikipron paperβ15Apr 7, 2020Updated 6 years ago
- Deepspeech ASR Model for the Catalan Languageβ17Feb 15, 2021Updated 5 years ago
- Simple Kaldi recipe for forced alignmentβ11Jul 16, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- β14Jun 16, 2023Updated 2 years ago
- scipts for working with open.bible dataβ26Jan 24, 2022Updated 4 years ago
- bin filesβ13Jan 30, 2025Updated last year
- Python wrapper for phonetisaurus grapheme to phoneme toolβ12Mar 11, 2021Updated 5 years ago
- Phonetically-Oriented Word Error Rateβ36May 4, 2019Updated 7 years ago
- Datasets for machine translationβ10Jul 5, 2019Updated 6 years ago
- DeepSpeech based forced alignment toolβ239Dec 12, 2020Updated 5 years ago
- GlottDNN vocoder and tools for training DNN excitation modelsβ33Feb 27, 2021Updated 5 years ago
- Neural ngram language model in PyTorch.β10Sep 27, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A GPU language model, based on btree backed tries.β30Mar 6, 2018Updated 8 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.β15May 19, 2020Updated 6 years ago
- Acoustic and language models for minorised languages.β26Sep 30, 2020Updated 5 years ago
- β15Apr 15, 2016Updated 10 years ago
- Public domain corpus of Catalan textβ18Dec 20, 2021Updated 4 years ago
- Pytorch code of "A new automatic speech recognizer for Brazilian Portuguese based on deep neural networks and transfer learning" submitteβ¦β21Sep 30, 2019Updated 6 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.β18Dec 21, 2021Updated 4 years ago
- Port of Python's pdfminer to Lispβ15Jan 30, 2016Updated 10 years ago
- Deep Multi-Speech modelβ11Jul 25, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A handy dataset of noises for ASRβ22May 29, 2019Updated 7 years ago
- EESEN based offline transcriber VM using models trained on TEDLIUM and Cantab Researchβ50Jun 4, 2019Updated 7 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.β14Jun 17, 2021Updated 4 years ago
- β37Mar 26, 2024Updated 2 years ago
- scripts used for SMT system submitted to WMT 2014β12Apr 30, 2017Updated 9 years ago
- Web dashboard for discord bots, in development.β11Feb 2, 2018Updated 8 years ago
- Linguistic processing for Common Voiceβ59Jan 18, 2024Updated 2 years ago