dumitrescustefan / roner
Named Entity Recognition for Romanian, based on transformer models
☆12Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for roner
- A novel dataset for emotion detection from Romanian text.☆15Updated 2 weeks ago
- This repo is the home of Romanian Transformers.☆93Updated 2 years ago
- Romanian Named Entity Corpus (RONEC) version 2.0☆60Updated last year
- 110k Dutch Book Reviews Dataset for Sentiment Analysis☆30Updated last year
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆111Updated 6 months ago
- Python Multilingual Ucrel Semantic Analysis System☆30Updated 2 months ago
- spaCy + UDPipe☆161Updated 2 years ago
- Legal document classification with EuroVoc descriptors on 22 languages.☆25Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆144Updated this week
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆27Updated 4 months ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆54Updated this week
- A character-wise tokenizer for morphologically rich languages☆27Updated 4 months ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆135Updated 3 months ago
- Romanian WordNet (Data + API for Python)☆49Updated 4 years ago
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated last year
- Python 3 library for processing historical English☆64Updated 3 months ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆75Updated 3 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆77Updated 9 months ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆70Updated last week
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆68Updated last year
- ParlaMint: Comparable Parliamentary Corpora☆45Updated 3 weeks ago
- The Mueller Report Corpus V 0.1☆11Updated 4 years ago
- An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic a…☆18Updated last month
- Romanian Semantic Textual Similarity Dataset☆15Updated 2 years ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆17Updated last month
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆50Updated last year
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- Norwegian Speech Transformer Models☆17Updated this week