neocl / speachLinks
ππ Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, JSON, SQLite, VTT, Audacity, TTL, TIG, ISF, etc.)
β20Updated last year
Alternatives and similar repositories for speach
Users that are interested in speach are comparing it to the libraries listed below
Sorting:
- SIGMORPHON 2022 Shared Task on Morpheme Segmentationβ29Updated 2 years ago
- Python Finite-State Toolkitβ60Updated last week
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.β76Updated 2 years ago
- Gamma Agreement in Pythonβ45Updated last year
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammarsβ17Updated last year
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)β52Updated 2 years ago
- β19Updated 4 years ago
- Featurize words into orthographic and phonological vectors.β41Updated 2 years ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioningβ34Updated last month
- An NLP pipeline for Hebrewβ40Updated 5 months ago
- A tiny BERT for low-resource monolingual modelsβ31Updated 2 months ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressionsβ29Updated 5 years ago
- A guide to building language technology in new languages.β59Updated 3 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentationβ197Updated 5 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.β34Updated 8 months ago
- β45Updated 3 years ago
- List of corpora annotated for coreference for different languagesβ17Updated last year
- β50Updated last year
- A minimal, pure Python library to interface with CoNLL-U format files.β152Updated this week
- β12Updated 5 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict formatβ33Updated 6 years ago
- β22Updated 3 years ago
- Forced Alignments for Common Voiceβ31Updated 5 years ago
- German Morphological Analyzerβ50Updated 4 years ago
- MultiLexNorm 2021 competition system from ΓFALβ15Updated 3 years ago
- Unicode Standard tokenization routines and orthography profile segmentationβ38Updated 9 months ago
- Multilingual Open Textβ25Updated 6 months ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)β68Updated 2 weeks ago
- OpusFilter - Parallel corpus processing toolkitβ112Updated 2 weeks ago
- π software for creating speech recognition models.β159Updated last year