neocl / speachLinks
ππ Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, JSON, SQLite, VTT, Audacity, TTL, TIG, ISF, etc.)
β17Updated last year
Alternatives and similar repositories for speach
Users that are interested in speach are comparing it to the libraries listed below
Sorting:
- Gamma Agreement in Pythonβ45Updated last year
- β22Updated 3 years ago
- Python Finite-State Toolkitβ57Updated last week
- A guide to building language technology in new languages.β58Updated 3 years ago
- A tiny BERT for low-resource monolingual modelsβ31Updated 10 months ago
- β44Updated 3 years ago
- Software for phonetic transcription of English and Finnish, and IPA toolsβ15Updated 9 years ago
- Unicode Standard tokenization routines and orthography profile segmentationβ37Updated 5 months ago
- A repository containing links to useful phonological softwareβ12Updated 2 years ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammarsβ17Updated last year
- Breaks a word into syllables using an LSTM-based neural network.β20Updated last year
- SIGMORPHON 2022 Shared Task on Morpheme Segmentationβ26Updated 2 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)β30Updated last month
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict formatβ33Updated 6 years ago
- List of corpora annotated for coreference for different languagesβ17Updated last year
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)β47Updated 2 years ago
- phone inventory libraryβ16Updated 2 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.β76Updated last year
- SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILPβ14Updated 4 years ago
- several algorithms for converting dependency structures into constituency structures.β10Updated 3 years ago
- MultiLexNorm 2021 competition system from ΓFALβ15Updated 3 years ago
- Proposed splits for the LREC Wikipron paperβ14Updated 5 years ago
- Featurize words into orthographic and phonological vectors.β41Updated 2 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP researchβ34Updated 2 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentationβ197Updated 4 years ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioningβ33Updated last week
- NTREX -- News Test References for MT Evaluationβ85Updated last year
- Spoken Language Identification on Common Voice and AudioSet using Deep Learningβ40Updated 3 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forestsβ41Updated 2 years ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressionsβ27Updated 5 years ago