mattia-decao / hiero-transformerLinks
☆12Updated last year
Alternatives and similar repositories for hiero-transformer
Users that are interested in hiero-transformer are comparing it to the libraries listed below
Sorting:
- Latin BERT☆69Updated last year
- [JOHD 23] This repository hosts the code to get the artifects of Cuneiform in the paper CuneiML: A Cuneiform Dataset for Machine Learning…☆15Updated last year
- Linguistic Reconstruction with LingPy☆15Updated last year
- Building an effective preprocessing tool for African languages☆13Updated last year
- Python 3 library for processing historical English☆67Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆112Updated last year
- ☆32Updated last year
- Machine Learning for Ancient Languages☆31Updated last year
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆13Updated last year
- ☆12Updated 6 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 4 years ago
- COMET for African languages☆10Updated 11 months ago
- MAFAND-MT☆60Updated last year
- Layout Analysis Dataset with Segmonto (LADaS)☆23Updated 5 months ago
- Data for the HIPE 2022 shared task.☆21Updated 2 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆23Updated 3 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆80Updated 3 years ago
- Masakhane Web is a translation web application for solely African Languages.☆37Updated 2 years ago
- Collatinus Python Lemmatizer☆10Updated 4 years ago
- Programming for Historians☆16Updated 3 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆78Updated 3 weeks ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆87Updated 2 months ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated 2 years ago
- ☆23Updated 2 years ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆35Updated 2 months ago
- Pipeline to generate the Standardized Project Gutenberg Corpus☆206Updated 2 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆35Updated 9 months ago
- A French Lemmatizer in Python based on the LEFFF☆42Updated 5 years ago
- Compiled tools, datasets, and other resources for historical text normalization.☆20Updated 6 years ago
- ☆66Updated 4 months ago