OnlpLab / NEMO-Corpus
Named Entity (NER) annotations of the Hebrew Treebank (Haaretz newspaper) corpus, including: morpheme and token level NER labels, nested mentions, and more.
☆10Updated 3 years ago
Alternatives and similar repositories for NEMO-Corpus:
Users that are interested in NEMO-Corpus are comparing it to the libraries listed below
- Neural Modeling for Named Entities and Morphology (Hebrew NER)☆31Updated 2 years ago
- ☆18Updated 8 months ago
- An NLP pipeline for Hebrew☆37Updated 3 weeks ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 7 months ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- List of corpora annotated for coreference for different languages☆17Updated 7 months ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- ☆64Updated 2 years ago
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆22Updated 3 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 3 years ago
- ☆47Updated 2 years ago
- A tool for text normalisation via character-level machine translation☆13Updated 4 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆27Updated 3 years ago
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆26Updated 4 years ago
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated last year
- Data for the HIPE 2022 shared task.☆17Updated last year
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- several algorithms for converting dependency structures into constituency structures.☆10Updated 3 years ago
- An easy-to-use API for analyzing INCEpTION annotation projects.☆17Updated last year
- Learned string similarity for entity names using optimal transport.☆35Updated 4 years ago
- Simple library to work with pre-trained ELMo models in TensorFlow☆52Updated last year
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Updated 5 years ago
- Word Sense Induction with BERT MLM☆28Updated last year
- Datasets for the Monolingual Word Sense Alignment (MWSA) task☆12Updated 4 years ago
- UIMA CAS processing library written in Python☆87Updated last week
- Tool for parsing and converting various span encoding schemes.☆23Updated last year
- Code for pre-training CharacterBERT models (as well as BERT models).☆34Updated 3 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆35Updated 2 years ago