dumitrescustefan / RO-STS
Romanian Semantic Textual Similarity Dataset
☆15Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for RO-STS
- This repo is the home of Romanian Transformers.☆93Updated 2 years ago
- Romanian Named Entity Corpus (RONEC) version 2.0☆60Updated last year
- A novel dataset for emotion detection from Romanian text.☆15Updated 2 weeks ago
- A list of Romanian NLP Datasets☆30Updated 3 weeks ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated last year
- ☆48Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆76Updated 4 months ago
- Some notebooks for NLP☆187Updated last year
- Named Entity Recognition for Romanian, based on transformer models☆12Updated 2 years ago
- ☆12Updated 3 years ago
- Romanian WordNet (Data + API for Python)☆49Updated 4 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆66Updated 3 years ago
- BERT model trained from scratch on Finnish☆96Updated 3 years ago
- How good is BERT ? Comparing BERT to other state-of-the-art approaches on a French sentiment analysis dataset☆149Updated last year
- Fine-tune transformers with pytorch-lightning☆44Updated 2 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆107Updated last year
- This is a neural spell checker☆60Updated last year
- xfspell — the Transformer Spell Checker☆187Updated 4 years ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆157Updated last week
- spaCy + UDPipe☆161Updated 2 years ago
- Unannotated Spanish 3 Billion Words Corpora☆92Updated 2 years ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆29Updated 3 years ago
- A small tool that EXPLains spACY parse results. See what I did there?☆83Updated 2 years ago
- MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert…☆48Updated 3 years ago
- Jupyter notebooks that use the Fastai library☆91Updated 3 years ago
- ☆104Updated 10 months ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆91Updated 6 months ago
- List of research and engineering of NLP for American Native/Indigenous Languages.☆87Updated 3 years ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆73Updated 2 years ago
- Official repository of the Hate Speech Detection Tasks at Evalita☆12Updated 3 years ago