dumitrescustefan / roner
Named Entity Recognition for Romanian, based on transformer models
☆13Updated 3 years ago
Alternatives and similar repositories for roner:
Users that are interested in roner are comparing it to the libraries listed below
- A novel dataset for emotion detection from Romanian text.☆17Updated 2 months ago
- This repo is the home of Romanian Transformers.☆101Updated 2 years ago
- Romanian Named Entity Corpus (RONEC) version 2.0☆63Updated 2 years ago
- Romanian Semantic Textual Similarity Dataset☆16Updated 2 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- Romanian WordNet (Data + API for Python)☆51Updated 4 years ago
- Legal document classification with EuroVoc descriptors on 22 languages.☆27Updated last year
- ParlaMint: Comparable Parliamentary Corpora☆60Updated last month
- Schema for modelling parliamentary debates☆21Updated 2 years ago
- A list of Natural Language Processing Tools for Romanian☆30Updated 4 years ago
- Simple customizable pipeline tool for anonymizing Danish text.☆10Updated 7 months ago
- Conversational Agent Research Toolkit☆12Updated 11 months ago
- The curation repository for the data behind Concepticon.☆38Updated 2 months ago
- (WIP) a platform for language documentation☆39Updated 4 months ago
- A tool for automatic spelling normalization☆20Updated 4 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆478Updated 5 months ago
- A tool to extract canonical references from text.☆20Updated 3 years ago
- Norwegian Speech Transformer Models☆18Updated 5 months ago
- Datasets for fake news and misinformation detection☆66Updated last year
- GerVADER - A German adaptation of the VADER sentiment analysis tool for social media texts☆25Updated 2 years ago
- Python Multilingual Ucrel Semantic Analysis System☆31Updated 8 months ago
- A character-wise tokenizer for morphologically rich languages☆27Updated last month
- The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.☆15Updated last year
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆157Updated 2 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated 11 months ago
- Command Line Interface (CLI) to export METS/ALTO documents to other formats.☆13Updated 3 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆154Updated 5 months ago
- A lemmatizer for German language text☆89Updated 2 years ago
- Implementation and helper scripts for the BART-TL model - https://www.aclweb.org/anthology/2021.eacl-main.121/☆16Updated 3 years ago
- Jupyter notebooks for course "Computational Morphology with HFST".☆18Updated 2 years ago