AndyTheFactory / romanian-nlp-datasets
A list of Romanian NLP Datasets
☆30Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for romanian-nlp-datasets
- This repo is the home of Romanian Transformers.☆93Updated 2 years ago
- Romanian Semantic Textual Similarity Dataset☆15Updated 2 years ago
- Romanian WordNet (Data + API for Python)☆49Updated 4 years ago
- Romanian Named Entity Corpus (RONEC) version 2.0☆60Updated last year
- A novel dataset for emotion detection from Romanian text.☆15Updated 2 weeks ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated last year
- A module to compute textual lexical richness (aka lexical diversity).☆92Updated last year
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 6 months ago
- Some notebooks for NLP☆187Updated last year
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆157Updated last week
- This is a neural spell checker☆60Updated last year
- this is where we share notebooks/projects used in your youtube channel☆147Updated 3 years ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆93Updated this week
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 5 months ago
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆92Updated 2 weeks ago
- ☆147Updated 4 months ago
- Fair Embedding Engine☆12Updated 4 years ago
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆14Updated 7 months ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆66Updated 3 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆96Updated last year
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆103Updated 6 months ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆69Updated 2 months ago
- Entity linking evaluation and analysis tool☆19Updated last week
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆85Updated last year
- A repo to explore different NLP tasks which can be solved using T5☆169Updated 3 years ago
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15Updated 3 years ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆73Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- Stanford's Alexa Prize socialbot☆131Updated last year
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆20Updated last year