dumitrescustefan / roner
Named Entity Recognition for Romanian, based on transformer models
☆12Updated 2 years ago
Alternatives and similar repositories for roner:
Users that are interested in roner are comparing it to the libraries listed below
- A novel dataset for emotion detection from Romanian text.☆17Updated 2 months ago
- This repo is the home of Romanian Transformers.☆98Updated 2 years ago
- Romanian Named Entity Corpus (RONEC) version 2.0☆61Updated 2 years ago
- Simple customizable pipeline tool for anonymizing Danish text.☆9Updated 4 months ago
- Schema for modelling parliamentary debates☆21Updated 2 years ago
- ParlaMint: Comparable Parliamentary Corpora☆54Updated this week
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- A tool for automatic spelling normalization☆20Updated 4 years ago
- A tool to extract canonical references from text.☆20Updated 3 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆22Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆151Updated 2 months ago
- A Python library for topic modeling and visualization☆64Updated 4 years ago
- Norwegian Speech Transformer Models☆17Updated 2 months ago
- Main repository for all code and data related to Visual Analytics (F24)☆16Updated 8 months ago
- Morphological analyzer and lemmatizer for Latin.☆25Updated 2 months ago
- Cyber Hate detection And tracking on Social mEdia☆31Updated 2 years ago
- The Mueller Report Corpus V 0.1☆11Updated 4 years ago
- Yet another search platform for linguistic corpora.☆20Updated this week
- ☆10Updated 4 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆22Updated 2 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆156Updated 2 years ago
- A software to detect text reuse with BLAST.☆14Updated 5 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated this week
- ☆11Updated 7 months ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated 8 months ago
- Legal document classification with EuroVoc descriptors on 22 languages.☆25Updated last year
- Extension for pie to include taggers with their models and pre/postprocessors☆10Updated 7 months ago
- Python library to parse Apertium stream format☆13Updated 2 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆66Updated last month
- An R package for analysis of dramatic texts☆15Updated 2 years ago