dumitrescustefan / ronecLinks
Romanian Named Entity Corpus (RONEC) version 2.0
☆65Updated 3 years ago
Alternatives and similar repositories for ronec
Users that are interested in ronec are comparing it to the libraries listed below
Sorting:
- This repo is the home of Romanian Transformers.☆108Updated 3 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 3 years ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆418Updated 10 months ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆511Updated last year
- A sentence segmenter that actually works!☆304Updated 5 years ago
- A Dutch RoBERTa-based language model☆207Updated last year
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆141Updated 2 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆150Updated last year
- ☆50Updated last year
- Text tokenization and sentence segmentation (segtok v2)☆208Updated 3 years ago
- Compound splitter for German☆110Updated 5 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆86Updated 3 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆34Updated 9 months ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆78Updated 3 years ago
- A french sequence to sequence pretrained model☆62Updated 3 years ago
- Information extraction from English and German texts based on predicate logic☆392Updated 3 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆34Updated 3 years ago
- Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing☆562Updated last year
- RoBERTa models for Polish☆89Updated 3 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆84Updated last year
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆395Updated 2 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆390Updated 2 weeks ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆256Updated 3 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆115Updated last year
- 110k Dutch Book Reviews Dataset for Sentiment Analysis☆29Updated 2 years ago
- DaNLP is a repository for Natural Language Processing resources for the Danish Language.☆207Updated 10 months ago
- Applying BERT to named entity recognition in English and Russian.☆162Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated 2 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆319Updated 2 weeks ago
- A Dataset of German Legal Documents for Named Entity Recognition☆172Updated 3 years ago