finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests
β14Jan 24, 2017Updated 9 years ago
Alternatives and similar repositories for carmel
Users that are interested in carmel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Randomly sample lines from massive text files efficientlyβ17Apr 1, 2015Updated 11 years ago
- Coqui STT (πΈSTT) based forced alignment toolβ13Feb 24, 2022Updated 4 years ago
- β13Nov 16, 2022Updated 3 years ago
- Symmetrized word alignment models, based on mgizapp and GIZA++β14Jun 23, 2014Updated 11 years ago
- Finite state compiler, processor and helper tools used by apertiumβ20Jan 29, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A free & open tool for transcribing audio interviews with offline ASR supportβ25Dec 21, 2023Updated 2 years ago
- Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)β10Jun 2, 2021Updated 4 years ago
- Expected edit distance implementation using OpenFst toolsβ11May 13, 2015Updated 10 years ago
- steps to perform text-based speaker diarization with kaldi toolkitβ12Nov 2, 2018Updated 7 years ago
- Automatically exported from code.google.com/p/transducersaurusβ11Apr 1, 2015Updated 11 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based β¦β16Sep 5, 2017Updated 8 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-coreβ15Jun 19, 2023Updated 2 years ago
- Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15β12Apr 17, 2017Updated 9 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACLβ10Aug 11, 2016Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- English Resource Grammarβ27Apr 1, 2026Updated 2 weeks ago
- Proposed splits for the LREC Wikipron paperβ15Apr 7, 2020Updated 6 years ago
- Deepspeech ASR Model for the Catalan Languageβ17Feb 15, 2021Updated 5 years ago
- NLRB data scraper by LexPredictβ12Dec 8, 2022Updated 3 years ago
- Simple Kaldi recipe for forced alignmentβ11Jul 16, 2023Updated 2 years ago
- β13Oct 3, 2024Updated last year
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forestsβ41Oct 14, 2022Updated 3 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.β18May 31, 2023Updated 2 years ago
- Speech Dereverberation using weighted prediction errorβ11Dec 22, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammarsβ17Jun 14, 2024Updated last year
- Kaldi extended by Kaituo XU with new features in nnet1.β12Dec 16, 2018Updated 7 years ago
- Benchmark Arabic text diacritization datasetβ78Apr 7, 2026Updated last week
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Mar 6, 2023Updated 3 years ago
- a ducttape workflow for neural machine translationβ14Mar 23, 2021Updated 5 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWβ¦β18Nov 30, 2022Updated 3 years ago
- TEI-encoded contents of the Egyptian Gazetteβ15Jun 11, 2024Updated last year
- Python wrapper for phonetisaurus grapheme to phoneme toolβ12Mar 11, 2021Updated 5 years ago
- Build OpenFst using ndk-buildβ11Nov 22, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- bilingual dictionary extractor from parallel corporaβ23Jul 3, 2014Updated 11 years ago
- BurrMill coreβ22Nov 2, 2021Updated 4 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.β15May 19, 2020Updated 5 years ago
- A pretrained Wikipedia Doc2Vec models repository. No one did this, so I do.β14May 21, 2020Updated 5 years ago
- Acoustic and language models for minorised languages.β26Sep 30, 2020Updated 5 years ago
- RΓΆttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Experimental Codeβ11May 18, 2021Updated 4 years ago
- Public domain corpus of Catalan textβ18Dec 20, 2021Updated 4 years ago