UnitexGramLab / unitex-core
Unitex/GramLab C++ Core
☆22Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for unitex-core
- Unitex/GramLab Language Resources☆20Updated 2 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆61Updated 6 months ago
- Unitex/GramLab Java IDE☆13Updated 4 months ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆27Updated 5 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆86Updated 10 months ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆16Updated this week
- A tool for automatic spelling normalization☆20Updated 3 years ago
- Full Stack of Latvian Language Resources for Natural Language Understanding (NLU) and Generation (NLG)☆14Updated 2 years ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆40Updated 2 weeks ago
- Various utilities for processing the data.☆207Updated this week
- Python for Linguists – a Gentle Introduction to Programming☆44Updated 8 years ago
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆69Updated this week
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆124Updated this week
- AUTOTYP data export☆40Updated last year
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆54Updated last week
- linguistics backend☆40Updated last year
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆61Updated this week
- Official releases of the PROIEL treebank of ancient Indo-European languages☆36Updated last year
- Some examples of usage of Grobid in a third party java project.☆18Updated last year
- Multi Tier Annotation Search☆12Updated 6 months ago
- This packages up data for the Open Multilingual Wordnet☆43Updated 3 weeks ago
- Automatically exported from code.google.com/p/hunpos☆11Updated 6 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆65Updated this week
- Detect and align similar passages☆88Updated 2 months ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- ☆63Updated 6 months ago
- The curation repository for the data behind Concepticon.☆34Updated this week
- LingPy: Python library for quantitative tasks in historical linguistics☆125Updated 11 months ago
- Extension of the mate-tools NLP pipeline☆67Updated 8 years ago