amir-zeldes / RFTokenizer
A character-wise tokenizer for morphologically rich languages
☆27Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for RFTokenizer
- An NLP pipeline for Hebrew☆34Updated 6 months ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆54Updated this week
- A tool for automatic spelling normalization☆20Updated 3 years ago
- Runnable morphological analysis tools from the UniMorph project☆14Updated 5 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆60Updated this week
- ☆63Updated 5 months ago
- German Morphological Analyzer☆47Updated 2 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Updated 5 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆27Updated 4 months ago
- A simple configurable tool for manipulating dependency trees.☆13Updated 6 months ago
- Repository for the Georgetown University Multilayer Corpus (GUM)☆88Updated last week
- Multi Tier Annotation Search☆26Updated 3 years ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆17Updated 5 months ago
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆69Updated this week
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Annotation tool for coreference☆32Updated last year
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆110Updated 4 months ago
- Python framework for processing Universal Dependencies data☆56Updated last week
- Python Finite-State Toolkit☆44Updated 3 months ago
- ☆64Updated last year
- Compiled tools, datasets, and other resources for historical text normalization.☆16Updated 5 years ago
- A multilingual parallel corpus created from translations of the Bible.☆175Updated last month
- Python Multilingual Ucrel Semantic Analysis System☆30Updated 2 months ago
- BERT and ELECTRA models trained on Europeana Newspapers☆36Updated 2 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆111Updated 6 months ago
- The NLG tool for Finnish☆22Updated 10 months ago
- ☆19Updated 3 years ago
- Python 3 library for processing historical English☆64Updated 2 months ago
- Poetry Corpora Annotated on Aesthetic Emotions☆11Updated 2 years ago