jcblaisecruz02 / Filipino-Text-BenchmarksLinks
Open-source benchmark datasets and pretrained transformer models in the Filipino language.
☆63Updated last year
Alternatives and similar repositories for Filipino-Text-Benchmarks
Users that are interested in Filipino-Text-Benchmarks are comparing it to the libraries listed below
Sorting:
- Dataset for Emotion Recognition Research☆216Updated 2 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆112Updated last year
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆49Updated last year
- ☆115Updated last month
- Text Summarization for Research Papers☆78Updated 3 years ago
- This repository contains a dataset for hate speech detection on social media platforms.☆74Updated 2 years ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆93Updated 10 months ago
- Catalog of abusive language data (PLoS 2020)☆320Updated last year
- Datasets for Hate Speech Detection☆133Updated 2 years ago
- Tagalog Words Stemmer using Python☆30Updated 2 years ago
- A lightweight Python library for constructing, processing, and visualizing constituent trees.☆68Updated last week
- A multilingual lexicon of words to hurt.☆92Updated last month
- A module to compute textual lexical richness (aka lexical diversity).☆110Updated 2 years ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆31Updated 4 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆110Updated 2 years ago
- This repository holds the code for working with data from counselchat.com☆172Updated 2 years ago
- Abstractive and Extractive Text summarization using Transformers.☆86Updated 2 years ago
- MobileBERT and DistilBERT for extractive summarization☆91Updated 2 years ago
- An NLP system for generating reading comprehension questions☆297Updated last year
- A data set and model for german sentiment classification.☆68Updated 5 months ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆81Updated 2 years ago
- How to extract sentiment from opinions without any labels☆138Updated 3 years ago
- XED multilingual emotion datasets☆64Updated 2 years ago
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆176Updated 5 years ago
- Some notebooks for NLP☆207Updated 2 years ago
- A paraphrase generator built using the T5 model which produces paraphrased English sentences.☆318Updated 3 weeks ago
- TUFS Asian Language Parallel Corpus☆51Updated 2 years ago
- A dataset for Indonesian Named Entity Recognizer☆30Updated 4 years ago
- More than 43+ collections of Thai Natural Language Processing libraries. Update daily.☆30Updated 7 years ago
- The first large-scale summarization corpus for the Indonesian language. AACL 2020.☆38Updated 4 years ago