jcblaisecruz02 / Filipino-Text-BenchmarksLinks
Open-source benchmark datasets and pretrained transformer models in the Filipino language.
☆63Updated last year
Alternatives and similar repositories for Filipino-Text-Benchmarks
Users that are interested in Filipino-Text-Benchmarks are comparing it to the libraries listed below
Sorting:
- This repository contains a dataset for hate speech detection on social media platforms.☆74Updated 2 years ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆49Updated last year
- A module to compute textual lexical richness (aka lexical diversity).☆110Updated 2 years ago
- Dataset for Emotion Recognition Research☆214Updated 2 years ago
- Class for Aspect-term extraction and Aspect-based sentiment analysis with BERT and Adapters☆44Updated 3 years ago
- Catalog of abusive language data (PLoS 2020)☆316Updated last year
- ☆111Updated last year
- How good is BERT ? Comparing BERT to other state-of-the-art approaches on a French sentiment analysis dataset☆156Updated 2 years ago
- Datasets for Hate Speech Detection☆131Updated 2 years ago
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆175Updated 5 years ago
- Fake news detection in Filipino via Multitask Transfer Learning☆16Updated last year
- A lightweight Python library for constructing, processing, and visualizing constituent trees.☆67Updated 8 months ago
- A data set and model for german sentiment classification.☆67Updated 3 months ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆55Updated 5 years ago
- How to extract sentiment from opinions without any labels☆139Updated 3 years ago
- MobileBERT and DistilBERT for extractive summarization☆92Updated 2 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆111Updated last year
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆48Updated 2 years ago
- Unannotated Spanish 3 Billion Words Corpora☆104Updated 2 years ago
- ☆151Updated 2 years ago
- Official repository of the Hate Speech Detection Tasks at Evalita☆12Updated 4 years ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆89Updated 8 months ago
- Detect toxic spans in toxic texts☆70Updated 2 years ago
- Testing and training detection models for emoji-based hate speech.☆24Updated 3 years ago
- A multilingual lexicon of words to hurt.☆90Updated 2 months ago
- This repository contains EmoBank, a large-scale text corpus manually annotated with emotion according to the psychological Valence-Arousa…☆213Updated 2 years ago
- The SentiWordNet sentiment lexicon☆332Updated 3 years ago
- XED multilingual emotion datasets☆62Updated 2 years ago
- We gather Malaysian dataset! https://malaysian-dataset.readthedocs.io/☆324Updated 3 weeks ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆79Updated 2 years ago