jcblaisecruz02 / Filipino-Text-BenchmarksLinks
Open-source benchmark datasets and pretrained transformer models in the Filipino language.
☆63Updated last year
Alternatives and similar repositories for Filipino-Text-Benchmarks
Users that are interested in Filipino-Text-Benchmarks are comparing it to the libraries listed below
Sorting:
- Dataset for Emotion Recognition Research☆216Updated 2 years ago
- ☆115Updated 2 months ago
- Catalog of abusive language data (PLoS 2020)☆321Updated last year
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆49Updated last year
- A module to compute textual lexical richness (aka lexical diversity).☆110Updated 2 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆112Updated last year
- How good is BERT ? Comparing BERT to other state-of-the-art approaches on a French sentiment analysis dataset☆157Updated 2 years ago
- open datasets for sentiment analysis based on tweets in English/Spanish/French/German/Italian☆75Updated 2 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Updated 2 months ago
- This repository contains a dataset for hate speech detection on social media platforms.☆74Updated 3 years ago
- Datasets for Hate Speech Detection☆134Updated 2 years ago
- Visualizations and helpers to improve and debug machine learning models for Rasa Open Source☆311Updated 3 years ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆418Updated 10 months ago
- Fixes contractions such as `you're` to `you are`☆318Updated 3 years ago
- This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 4…☆276Updated last year
- This directory gathers the tools developed by the Data Sourcing Working Group☆31Updated 4 years ago
- A multilingual lexicon of words to hurt.☆92Updated 2 months ago
- MobileBERT and DistilBERT for extractive summarization☆92Updated 2 years ago
- A lightweight Python library for constructing, processing, and visualizing constituent trees.☆68Updated last month
- Applying BERT to named entity recognition in English and Russian.☆162Updated 3 years ago
- Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data☆160Updated 2 years ago
- A Python library for calculating a large variety of metrics from text☆359Updated last year
- Aksharamukha Python Library☆55Updated 10 months ago
- Fake news detection in Filipino via Multitask Transfer Learning☆17Updated last year
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆52Updated 2 years ago
- Unannotated Spanish 3 Billion Words Corpora☆105Updated 3 years ago
- Python library to use Google Transliterate API which powers the G Input Tools☆22Updated 4 years ago
- Pipeline to generate the Standardized Project Gutenberg Corpus☆203Updated last year
- Some notebooks for NLP☆207Updated 2 years ago
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆175Updated 5 years ago