dumitrescustefan / ronec
Romanian Named Entity Corpus (RONEC) version 2.0
☆60Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ronec
- This repo is the home of Romanian Transformers.☆93Updated 2 years ago
- Named Entity Recognition for Romanian, based on transformer models☆12Updated 2 years ago
- Romanian Semantic Textual Similarity Dataset☆15Updated 2 years ago
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆135Updated last year
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆185Updated 4 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆82Updated 3 years ago
- Simple customizable pipeline tool for anonymizing Danish text.☆9Updated 2 months ago
- Romanian WordNet (Data + API for Python)☆49Updated 4 years ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 3 years ago
- Text tokenization and sentence segmentation (segtok v2)☆203Updated 2 years ago
- 🚀GUI for training spaCy models☆53Updated 3 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated 6 months ago
- A Greek edition of BERT pre-trained language model☆142Updated 3 months ago
- coFR: COreference resolution tool for FRench (and singletons).☆24Updated 4 years ago
- Compound splitter for German☆103Updated 4 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆155Updated last year
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆75Updated 3 years ago
- A Dutch RoBERTa-based language model☆197Updated 7 months ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆144Updated this week
- BERT model trained from scratch on Finnish☆96Updated 3 years ago
- A sentence segmenter that actually works!☆302Updated 4 years ago
- spaCy + UDPipe☆161Updated 2 years ago
- Named Entity Recognition data for Europeana Newspapers☆173Updated last year
- German Morphological Analyzer☆47Updated 3 years ago
- Information extraction from English and German texts based on predicate logic☆389Updated 2 years ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆17Updated 6 months ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- A tokenizer and sentence splitter for German and English web and social media texts.☆135Updated 3 months ago
- 🏖TagEditor - Annotation tool for spaCy☆187Updated 2 years ago