Alegzandra / Romanian-NLP-toolsLinks
A list of Natural Language Processing Tools for Romanian
☆31Updated 4 years ago
Alternatives and similar repositories for Romanian-NLP-tools
Users that are interested in Romanian-NLP-tools are comparing it to the libraries listed below
Sorting:
- A novel dataset for emotion detection from Romanian text.☆19Updated 3 months ago
- This repo is the home of Romanian Transformers.☆103Updated 2 years ago
- Romanian Word Embeddings. Here you can find pre-trained corpora of word embeddings. Current methods: CBOW, Skip-Gram, Fast-Text (from Gen…☆12Updated last month
- A list of Romanian NLP Datasets☆48Updated 3 months ago
- Named Entity Recognition for Romanian, based on transformer models☆13Updated 3 years ago
- Romanian Named Entity Corpus (RONEC) version 2.0☆64Updated 2 years ago
- Romanian WordNet (Data + API for Python)☆52Updated 4 years ago
- Fixes contractions such as `you're` to `you are`☆318Updated 2 years ago
- Romanian Semantic Textual Similarity Dataset☆16Updated 2 years ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks☆159Updated 2 years ago
- Applying BERT to named entity recognition in English and Russian.☆162Updated 2 years ago
- A multilingual lexicon of words to hurt.☆89Updated 7 months ago
- BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them t…☆142Updated 11 months ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆180Updated this week
- A module to compute textual lexical richness (aka lexical diversity).☆108Updated last year
- ✔️Contextual word checker for better suggestions (not actively maintained)☆413Updated 4 months ago
- Linguistic and stylistic complexity measures for (literary) texts☆81Updated last year
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆246Updated 2 years ago
- Train Spacy ner with custom dataset☆183Updated 2 years ago
- NeuSpell: A Neural Spelling Correction Toolkit☆695Updated last year
- ☆161Updated 11 months ago
- A Dataset of German Legal Documents for Named Entity Recognition☆169Updated 2 years ago
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- Custom Named Entity Recognition with Spacy3☆31Updated 3 years ago
- Some notebooks for NLP☆204Updated last year
- UIMA CAS processing library written in Python☆89Updated 2 months ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Repository for TweetEval☆376Updated 2 years ago
- The official tool for transforming doccano format into common dataset formats.☆107Updated 2 years ago