neocl / speachLinks
ππ Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, JSON, SQLite, VTT, Audacity, TTL, TIG, ISF, etc.)
β21Updated last year
Alternatives and similar repositories for speach
Users that are interested in speach are comparing it to the libraries listed below
Sorting:
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammarsβ17Updated last year
- Python Finite-State Toolkitβ60Updated last month
- SIGMORPHON 2022 Shared Task on Morpheme Segmentationβ31Updated 2 years ago
- β13Updated 5 years ago
- β19Updated 4 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.β76Updated 2 years ago
- An NLP pipeline for Hebrewβ41Updated 7 months ago
- A repository containing links to useful phonological softwareβ12Updated 2 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)β54Updated 2 years ago
- Featurize words into orthographic and phonological vectors.β41Updated 2 years ago
- Gamma Agreement in Pythonβ45Updated last year
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree languageβ16Updated last week
- β50Updated last year
- NTREX -- News Test References for MT Evaluationβ88Updated last year
- A guide to building language technology in new languages.β59Updated 4 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)β34Updated 7 months ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressionsβ31Updated 5 years ago
- β45Updated 3 years ago
- Morphological Inflection for Low-Resource Languages using cross-lingual transferβ21Updated 6 years ago
- Unicode Standard tokenization routines and orthography profile segmentationβ39Updated 11 months ago
- Bilingual sentence similarity classifier using Tensorflowβ24Updated 6 years ago
- A tiny BERT for low-resource monolingual modelsβ31Updated last month
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioningβ35Updated last month
- List of corpora annotated for coreference for different languagesβ17Updated last year
- β22Updated 3 years ago
- MultiLexNorm 2021 competition system from ΓFALβ15Updated 4 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict formatβ33Updated 6 years ago
- β34Updated 2 years ago
- Corpus preprocessingβ99Updated last year
- English web corpus with 4M tokens and several annotation typesβ26Updated 2 years ago