klintan / swedish-ner-corpus
Small semi-manual annotated web news corpus in Swedish for CoreNLP NER. 4 categories, PER, ORG, LOC and MISC.
☆11Updated 4 years ago
Alternatives and similar repositories for swedish-ner-corpus:
Users that are interested in swedish-ner-corpus are comparing it to the libraries listed below
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12Updated 3 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆41Updated 2 months ago
- Experiments with Zalando's flair library☆34Updated last year
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆66Updated 2 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆40Updated last year
- ☆16Updated 5 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆50Updated last year
- ☆64Updated last year
- ☆36Updated 7 years ago
- ☆31Updated 3 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆78Updated 6 months ago
- Swedish data☆13Updated last month
- Toolkit to compile a comparable/parallel corpus from European Parliament proceedings☆15Updated 4 years ago
- ☆25Updated 4 years ago
- Harassment Lexicon and Corpus☆29Updated 6 years ago
- An annotated corpus of argumentative microtexts☆39Updated 2 years ago
- Code for the paper "Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora", ACL 2020.☆18Updated 4 years ago
- A python wrapper for the multilingual temporal tagger HeidelTime.☆26Updated 2 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- Evaluating Text Representations on Lexical Composition☆24Updated 5 years ago
- Mining Discourse Markers for Unsupervised Sentence Representation Learning☆60Updated last year
- Creating crowdsourcing based experiments made easy☆10Updated 4 years ago
- A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contai…☆106Updated 5 years ago
- A thin wrapper around the DBpedia Spotlight HTTP API☆25Updated 7 years ago
- This is a german ELMo deep contextualized word representation. It is trained on a special German Wikipedia Text Corpus.☆28Updated 5 years ago
- A package for handy processing of semantic graphs such as AMR, with a special focus on standardized evaluation☆20Updated 3 months ago
- Plan and train German transformer models.☆23Updated 3 years ago
- Mining Argument Structures with Expressive Inference (Linear and LSTM Engines)☆64Updated 7 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 6 years ago