neocl / speachLinks
ππ Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, JSON, SQLite, VTT, Audacity, TTL, TIG, ISF, etc.)
β19Updated last year
Alternatives and similar repositories for speach
Users that are interested in speach are comparing it to the libraries listed below
Sorting:
- Python Finite-State Toolkitβ58Updated this week
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammarsβ17Updated last year
- SIGMORPHON 2022 Shared Task on Morpheme Segmentationβ28Updated 2 years ago
- A tiny BERT for low-resource monolingual modelsβ31Updated 11 months ago
- β12Updated 5 years ago
- A guide to building language technology in new languages.β59Updated 3 years ago
- Gamma Agreement in Pythonβ45Updated last year
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.β76Updated 2 years ago
- β49Updated last year
- β45Updated 3 years ago
- β22Updated 3 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)β29Updated 2 months ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict formatβ33Updated 6 years ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioningβ34Updated last month
- List of corpora annotated for coreference for different languagesβ17Updated last year
- Unicode Standard tokenization routines and orthography profile segmentationβ37Updated 7 months ago
- An easy-to-use library to extract indices from texts.β29Updated 4 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree languageβ16Updated this week
- Corpus preprocessingβ98Updated last year
- several algorithms for converting dependency structures into constituency structures.β10Updated 3 years ago
- β19Updated 3 years ago
- MultiLexNorm 2021 competition system from ΓFALβ15Updated 3 years ago
- These are lists for a variety of languages containing words that are distinctive to each language.β38Updated 3 years ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressionsβ27Updated 5 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)β47Updated 2 years ago
- An NLP pipeline for Hebrewβ39Updated 3 months ago
- NTREX -- News Test References for MT Evaluationβ85Updated last year
- A python true casing utility that restores case information for textsβ89Updated 2 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentationβ197Updated 4 years ago
- A simple neural truecaser written in pytorch and allennlp.β33Updated last year