jcblaisecruz02 / Filipino-Text-Benchmarks
Open-source benchmark datasets and pretrained transformer models in the Filipino language.
☆61Updated 8 months ago
Alternatives and similar repositories for Filipino-Text-Benchmarks:
Users that are interested in Filipino-Text-Benchmarks are comparing it to the libraries listed below
- Fake news detection in Filipino via Multitask Transfer Learning☆14Updated 8 months ago
- A module to compute textual lexical richness (aka lexical diversity).☆106Updated last year
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆31Updated last month
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆81Updated 3 months ago
- Dataset for Emotion Recognition Research☆210Updated 2 years ago
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆171Updated 4 years ago
- Tagalog Words Stemmer using Python☆28Updated last year
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆48Updated last year
- Testing and training detection models for emoji-based hate speech.☆23Updated 2 years ago
- ☆110Updated last year
- Official repository of the Hate Speech Detection Tasks at Evalita☆12Updated 4 years ago
- Machine learning models from Singapore's NLP research community☆35Updated 2 years ago
- Datasets for fake news and misinformation detection☆66Updated last year
- Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data☆150Updated 2 years ago
- A multilingual lexicon of words to hurt.☆89Updated 5 months ago
- This repository contains a dataset for hate speech detection on social media platforms.☆71Updated 2 years ago
- Package to extract connotation frames☆85Updated last year
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated last year
- HateEval 2019 - Task 5☆17Updated 6 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆88Updated last year
- ☆54Updated 3 years ago
- NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented an…☆25Updated 7 months ago
- ☆44Updated 2 years ago
- MobileBERT and DistilBERT for extractive summarization☆89Updated last year
- Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)☆35Updated 4 months ago
- Catalog of abusive language data (PLoS 2020)☆309Updated 10 months ago
- XED multilingual emotion datasets☆58Updated last year
- Benchmarking Multidomain English-Indonesian Machine Translation☆16Updated 4 years ago
- This repository contains papers and resources pertaining to Hate speech research.