Python Finite-State Toolkit
☆60Mar 5, 2026Updated this week
Alternatives and similar repositories for pyfoma
Users that are interested in pyfoma are comparing it to the libraries listed below
Sorting:
- Camel Morph’s goal is to build large open-source morphological models for Arabic and its dialects across many genres and domains.☆15Dec 8, 2024Updated last year
- VoxAngeles Corpus☆13Aug 23, 2025Updated 6 months ago
- A set of tools for analyzing languages via logic and automata☆27Feb 12, 2026Updated 3 weeks ago
- Automatically exported from code.google.com/p/foma☆129Feb 20, 2026Updated 2 weeks ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆33Mar 26, 2023Updated 2 years ago
- ☆42Dec 10, 2025Updated 2 months ago
- Domain-specific programming language for linguistic grammars and transducers — Langage dédié pour les grammaires linguistiques et les tra…☆17Updated this week
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆18Aug 17, 2021Updated 4 years ago
- An even smaller speech recognizer / force aligner☆37Dec 16, 2024Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your language☆43Feb 24, 2026Updated last week
- A family of efficient speech models for multilingual phone recognition☆45Feb 12, 2026Updated 3 weeks ago
- Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FST…☆181Jul 11, 2025Updated 7 months ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- Translation of query languages to serialized KoralQuery protocol☆13Feb 23, 2026Updated last week
- Audiobook alignment for Indigenous languages☆45Feb 4, 2026Updated last month
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- The Language Independent Intelligent Dictionary☆26Updated this week
- Next-generation Punkt sentence boundary detection with zero dependencies☆29Nov 18, 2025Updated 3 months ago
- Helsinki Finite-State Technology (library and application suite)☆137Feb 19, 2026Updated 2 weeks ago
- GUI applikation for the Klatt formant synthesizer package☆11Feb 16, 2026Updated 2 weeks ago
- An Easy Annotation Tool for Natural Language Processing☆11May 17, 2024Updated last year
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- ACL Rolling Review website☆11Feb 24, 2026Updated last week
- ☆10Mar 20, 2021Updated 4 years ago
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- AsoSoft Speech Corpus can be used for spoken language processing tasks in Central Kurdish such as speech recognition, speaker recognition…☆10Mar 8, 2022Updated 3 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Aug 10, 2023Updated 2 years ago
- Toy example on how to build a unit selection TTS in Spanish☆11May 10, 2019Updated 6 years ago
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆11Feb 5, 2020Updated 6 years ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Hua Ki'i static website version for ICLDC 2021. For more information contact rng.wlf@gmail.com☆15Mar 6, 2021Updated 5 years ago
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆11Oct 28, 2022Updated 3 years ago
- ☆16Oct 27, 2025Updated 4 months ago
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- ☆14May 24, 2022Updated 3 years ago
- fiwGAN/ciwGAN (Featural and Categorical InfoWaveGAN): Generative Adversarial Phonology and Semantics☆26May 24, 2023Updated 2 years ago