neocl / speachLinks
🐍🍑 Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, JSON, SQLite, VTT, Audacity, TTL, TIG, ISF, etc.)
☆20Updated last year
Alternatives and similar repositories for speach
Users that are interested in speach are comparing it to the libraries listed below
Sorting:
- ☆22Updated 3 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆28Updated 2 years ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Updated last year
- Python Finite-State Toolkit☆59Updated this week
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Updated 2 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆31Updated 4 months ago
- ☆19Updated 4 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆51Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 8 months ago
- phone inventory library☆17Updated 2 years ago
- Gamma Agreement in Python☆45Updated last year
- An NLP pipeline for Hebrew☆39Updated 4 months ago
- A tiny BERT for low-resource monolingual models☆31Updated last month
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 6 years ago
- Featurize words into orthographic and phonological vectors.☆41Updated 2 years ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆75Updated 7 months ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆34Updated 2 weeks ago
- An open-source framework for modeling real-time conversations in spoken dialogue systems.☆27Updated 3 years ago
- List of corpora annotated for coreference for different languages☆17Updated last year
- A guide to building language technology in new languages.☆59Updated 3 years ago
- SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP☆14Updated 4 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆197Updated 5 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Updated last year
- Corpus preprocessing☆99Updated last year
- ☆49Updated last year
- several algorithms for converting dependency structures into constituency structures.☆10Updated 3 years ago
- ☆10Updated 4 years ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆38Updated 3 years ago
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆20Updated 5 years ago