morfologik / polimorfologikLinks
Scripts for preprocessing morfologik data.
☆40Updated 8 years ago
Alternatives and similar repositories for polimorfologik
Users that are interested in polimorfologik are comparing it to the libraries listed below
Sorting:
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆197Updated 2 years ago
- Polish morphological tagger.☆43Updated 2 years ago
- Open morphology for Finnish☆95Updated last week
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆12Updated 2 years ago
- Unitex/GramLab Language Resources☆18Updated 3 years ago
- NER tagger for English, Spanish, Dutch, Italian and German and French.☆35Updated 10 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Updated 2 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆19Updated last week
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- Basic dataset for the linguistic data collection.☆15Updated 8 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated 2 years ago
- The Zurich Dependency Parser for German☆89Updated 3 months ago
- Some convenient natural language tools that build on NLTK.☆85Updated 11 years ago
- NLTK Contrib☆168Updated last year
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆129Updated 11 months ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 3 years ago
- ☆18Updated 10 years ago
- A project for code to create models from existing corpora and distribute models.☆42Updated 13 years ago
- Aelius is a suite of Python, NLTK-based modules and language data for training and evaluating POS-taggers for Brazilian Portuguese and an…☆19Updated 13 years ago
- WordNet-LMF formats☆24Updated 3 weeks ago
- Automatically exported from code.google.com/p/foma☆126Updated 3 months ago
- Transcripts and audio for the Berkeley Restaurant Project (BeRP) speech corpus☆23Updated last year
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Updated last month
- Helsinki Finite-State Technology (library and application suite)☆136Updated last month
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated this week
- ElixirFM Functional Arabic Morphology☆44Updated 2 years ago
- NameTag: Named Entity Tagger☆37Updated last year
- LEPOR: A Robust Evaluation Metric for Machine Translation with Augmented Factors☆16Updated 8 years ago
- Course in Natural Language Processing and Applications☆10Updated 3 years ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆74Updated last year