dumitrescustefan / ronec
Romanian Named Entity Corpus (RONEC) version 2.0
☆61Updated 2 years ago
Alternatives and similar repositories for ronec:
Users that are interested in ronec are comparing it to the libraries listed below
- This repo is the home of Romanian Transformers.☆98Updated 2 years ago
- A novel dataset for emotion detection from Romanian text.☆17Updated 2 months ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆155Updated 2 years ago
- Named Entity Recognition for Romanian, based on transformer models☆12Updated 2 years ago
- spaCy + UDPipe☆161Updated 2 years ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 3 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆135Updated last year
- Romanian WordNet (Data + API for Python)☆49Updated 4 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆79Updated 11 months ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆22Updated 2 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆82Updated 3 years ago
- Simple customizable pipeline tool for anonymizing Danish text.☆9Updated 4 months ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 4 years ago
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated last year
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆187Updated 4 years ago
- ☆44Updated 5 months ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆137Updated last month
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆22Updated last year
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆94Updated 3 weeks ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated last year
- UIMA CAS processing library written in Python☆86Updated 8 months ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆412Updated last month
- A sentence segmenter that actually works!☆303Updated 4 years ago
- A character-level BERT for Ancient Greek☆10Updated last year
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- ☆18Updated last week