antonisa / unimorph_inflect
A python library for easily querying morphological inflection models trained on Unimorph
☆11Updated last year
Related projects: ⓘ
- Runnable morphological analysis tools from the UniMorph project☆14Updated 5 years ago
- ☆24Updated 4 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Multilingual Open Text☆25Updated 5 months ago
- ☆23Updated 4 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 3 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆13Updated 4 years ago
- GlotScript: A Resource and Tool for Low Resource Writing System Identification -- LREC 2024☆13Updated 3 months ago
- A survey of corpora for Germanic low-resource languages and dialects☆24Updated last month
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- A repository for the 2022 Inflection Shared Task☆9Updated 2 years ago
- ☆42Updated last month
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆14Updated 4 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆32Updated last year
- A simple neural truecaser written in pytorch and allennlp.☆31Updated 3 months ago
- ☆13Updated 3 years ago
- Compiled tools, datasets, and other resources for historical text normalization.☆16Updated 5 years ago
- several algorithms for converting dependency structures into constituency structures.☆9Updated 2 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- ☆12Updated 8 years ago
- Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translation☆16Updated 3 years ago
- ☆19Updated 2 years ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆34Updated 2 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆11Updated last year
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆15Updated 3 months ago
- Python Finite-State Toolkit☆39Updated last month
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 2 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Updated 5 years ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Updated 3 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆27Updated last year