nytud / NYTK-NerKor
The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.
☆14Updated last year
Alternatives and similar repositories for NYTK-NerKor:
Users that are interested in NYTK-NerKor are comparing it to the libraries listed below
- ☆25Updated 4 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 3 months ago
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- Repository for the Georgetown University Multilayer Corpus (GUM)☆88Updated 3 weeks ago
- ☆16Updated 5 years ago
- e-magyar text processing system -- inter-module communication via tsv + REST API☆27Updated 11 months ago
- Jupyter extension to visualize dependency structures☆28Updated 6 years ago
- A Named-Entity Recogniser based on Grobid.☆49Updated 2 months ago
- Tools for compiling corpora from Common Crawl☆12Updated this week
- Multi Tier Annotation Search☆26Updated 3 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated 8 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆76Updated 4 months ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆39Updated last year
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆50Updated last year
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆61Updated 6 months ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Updated 4 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆36Updated 2 years ago
- Preliminary spaCy models for Latin☆14Updated 2 years ago
- ☆64Updated last year
- An unsupervised compound splitter☆40Updated 5 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 8 years ago
- A simple configurable tool for manipulating dependency trees.☆13Updated 6 months ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆11Updated 11 months ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Updated 2 years ago
- Python framework for processing Universal Dependencies data☆57Updated this week
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆41Updated last month
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆28Updated 6 years ago
- A thin wrapper around the DBpedia Spotlight HTTP API☆24Updated 6 years ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆54Updated 2 weeks ago