dumitrescustefan / RO-STS
Romanian Semantic Textual Similarity Dataset
☆16Updated 2 years ago
Alternatives and similar repositories for RO-STS:
Users that are interested in RO-STS are comparing it to the libraries listed below
- This repo is the home of Romanian Transformers.☆101Updated 2 years ago
- A novel dataset for emotion detection from Romanian text.☆17Updated 2 months ago
- Romanian Named Entity Corpus (RONEC) version 2.0☆63Updated 2 years ago
- Some notebooks for NLP☆200Updated last year
- xfspell — the Transformer Spell Checker☆190Updated 4 years ago
- Named Entity Recognition for Romanian, based on transformer models☆13Updated 3 years ago
- Yet Another Neural Machine Translation Toolkit☆178Updated last month
- ☆12Updated 3 years ago
- MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert…☆49Updated 3 years ago
- A list of Romanian NLP Datasets☆42Updated 2 months ago
- Romanian WordNet (Data + API for Python)☆51Updated 4 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- ☆110Updated last year
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆157Updated 2 years ago
- ☆138Updated last year
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆34Updated 3 years ago
- ☆14Updated 4 years ago
- MAFAND-MT☆55Updated 9 months ago
- German Morphological Analyzer☆47Updated 3 years ago
- An NLP system for generating reading comprehension questions☆288Updated last year
- Text2Text Language Modeling Toolkit☆300Updated 3 months ago
- This repository contains a dataset for hate speech detection on social media platforms.☆71Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆35Updated 2 years ago
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆15Updated 4 years ago
- Pre-trained, multilingual sequence-to-sequence models for Indian languages☆46Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆83Updated 10 months ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆73Updated 2 years ago
- A guide to building language technology in new languages.☆58Updated 3 years ago