dumitrescustefan / RO-STS
Romanian Semantic Textual Similarity Dataset
☆16Updated 2 years ago
Alternatives and similar repositories for RO-STS:
Users that are interested in RO-STS are comparing it to the libraries listed below
- This repo is the home of Romanian Transformers.☆101Updated 2 years ago
- A novel dataset for emotion detection from Romanian text.☆17Updated last month
- Romanian Named Entity Corpus (RONEC) version 2.0☆62Updated 2 years ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆34Updated 3 years ago
- Romanian WordNet (Data + API for Python)☆51Updated 4 years ago
- Some notebooks for NLP☆199Updated last year
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- BERT model trained from scratch on Finnish☆96Updated 3 years ago
- Named Entity Recognition for Romanian, based on transformer models☆13Updated 3 years ago
- Clustering sentence embeddings to extract message intent☆173Updated 3 years ago
- A french sequence to sequence pretrained model☆59Updated 2 years ago
- MAFAND-MT☆55Updated 8 months ago
- ☆50Updated 2 years ago
- Crosslingual Question Answering for African Languages☆29Updated 6 months ago
- ☆31Updated 6 years ago
- A curated list of resources such as tools and datasets useful for the processing of Slovak language☆19Updated 2 weeks ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆171Updated last week
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆73Updated 3 years ago
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- This is a neural spell checker☆65Updated 2 years ago
- A Dutch RoBERTa-based language model☆199Updated 11 months ago
- XAI Tutorial for the Explainable AI track in the ALPS winter school 2021☆58Updated 4 years ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- A list of Romanian NLP Datasets☆41Updated last month
- A Python library for calculating a large variety of metrics from text☆334Updated 3 months ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 2 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 3 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- ☆12Updated 3 years ago
- ☆14Updated 4 years ago