dumitrescustefan / ronecLinks
Romanian Named Entity Corpus (RONEC) version 2.0
☆65Updated 2 years ago
Alternatives and similar repositories for ronec
Users that are interested in ronec are comparing it to the libraries listed below
Sorting:
- This repo is the home of Romanian Transformers.☆103Updated 2 years ago
- A novel dataset for emotion detection from Romanian text.☆20Updated 4 months ago
- Neural based model for automatic diacritics restoration.☆25Updated 6 years ago
- Romanian WordNet (Data + API for Python)☆52Updated 4 years ago
- Named Entity Recognition for Romanian, based on transformer models☆13Updated 3 years ago
- A list of Natural Language Processing Tools for Romanian☆31Updated 4 years ago
- ☆50Updated 2 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆81Updated last year
- Romanian Semantic Textual Similarity Dataset☆16Updated 2 years ago
- Lexical database for ~70k English words with morphological variables☆44Updated 3 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆380Updated 7 months ago
- A sentence segmenter that actually works!☆307Updated 4 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆77Updated 3 years ago
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- Text tokenization and sentence segmentation (segtok v2)☆205Updated 3 years ago
- RoBERTa models for Polish☆87Updated 3 years ago
- This is a monolingual English corpus of native, non-native and (human) translated texts extracted from the European Parliament.☆9Updated 3 years ago
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated 2 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆83Updated 4 years ago
- 🏖TagEditor - Annotation tool for spaCy☆192Updated 2 years ago
- A Dutch RoBERTa-based language model☆205Updated last year
- ✔️Contextual word checker for better suggestions (not actively maintained)☆414Updated 4 months ago
- Polish morphological tagger.☆43Updated 2 years ago
- Multi Tier Annotation Search☆26Updated 4 years ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 4 years ago
- Compound splitter for German☆107Updated 5 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- A module to compute textual lexical richness (aka lexical diversity).