jcblaisecruz02/Filipino-Text-Benchmarks

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jcblaisecruz02/Filipino-Text-Benchmarks)

jcblaisecruz02 / Filipino-Text-Benchmarks

Open-source benchmark datasets and pretrained transformer models in the Filipino language.

☆67

Alternatives and similar repositories for Filipino-Text-Benchmarks

Users that are interested in Filipino-Text-Benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jcblaisecruz02 / Tagalog-fake-news
View on GitHub
Fake news detection in Filipino via Multitask Transfer Learning
☆17Aug 26, 2024Updated last year
matthewgo / FilipinoStanfordPOSTagger
View on GitHub
☆12Apr 26, 2020Updated 6 years ago
crlwingen / TagalogStemmerPython
View on GitHub
Tagalog Words Stemmer using Python
☆30May 21, 2023Updated 3 years ago
benhur07b / covid19ph-doh-data-dump
View on GitHub
Data Dump (CSV) of COVID-19 Data from the Philippines' Department of Health
☆14Apr 23, 2020Updated 6 years ago
BuzzFeedNews / whtranscripts
View on GitHub
Fetch and parse the American Presidency Project's press-briefing and presidential-news-conference transcripts.
☆11Aug 18, 2016Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ljvmiranda921 / calamanCy
View on GitHub
NLP pipelines for Tagalog using spaCy
☆71Jul 6, 2026Updated 2 weeks ago
williamscott701 / Cross-SEAN
View on GitHub
☆11Aug 25, 2021Updated 4 years ago
jhellingman / phildict
View on GitHub
Repository for Philippine language dictionary data
☆22Feb 4, 2023Updated 3 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
anaistack / cefr-asag-corpus
View on GitHub
A corpus of short answers written by learners of English and graded with CEFR levels
☆12Dec 17, 2021Updated 4 years ago
GeorgeVern / lmcor
View on GitHub
Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"
☆12Apr 20, 2024Updated 2 years ago
PyThaiNLP / thai-g2p-wiktionary-corpus
View on GitHub
Thai Grapheme to Phoneme (G2P) Wiktionary Corpus
☆13Jul 25, 2022Updated 4 years ago
LaSTUS-TALN-UPF / TSAR-2022-Shared-Task
View on GitHub
TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts
☆10Oct 27, 2022Updated 3 years ago
EricEchemane / Soil-Type-Classifier-Through-Image-Processing
View on GitHub
Soil Type Classification Through Image Processing and Machine Learning This project is in development process for our Thesis Project.
☆10Oct 9, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
wikifactcheck-english / wikifactcheck-english
View on GitHub
Data and download script to accompany LREC2020 paper "Automated Fact-Checking of Claims from Wikipedia"
☆13Jul 19, 2023Updated 3 years ago
phueb / Zorro
View on GitHub
Grammar test suite for masked language models
☆10Jan 1, 2023Updated 3 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
coryshain / dnnseg
View on GitHub
☆11Mar 20, 2021Updated 5 years ago
parthpatwa / covid19-fake-news-detection
View on GitHub
Official repository for data set and baselines for covid19 fake news data.
☆66Apr 30, 2021Updated 5 years ago
talaugust / definition-complexity
View on GitHub
☆14Jun 13, 2022Updated 4 years ago
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago
freedomvote / freedomvote
View on GitHub
A tool to represent the views of politicians and parties as a help to the voters.
☆17Jul 13, 2026Updated last week
dan-wells / kiss-aligner
View on GitHub
Simple Kaldi recipe for forced alignment
☆11Jul 16, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
apepa / claim-rank
View on GitHub
☆17Aug 27, 2018Updated 7 years ago
iris2hu / diachronic-sense-modeling
View on GitHub
☆23Jun 2, 2019Updated 7 years ago
beinborn / relative_importance
View on GitHub
☆17Jun 17, 2025Updated last year
xinjli / phonepiece
View on GitHub
phone inventory library
☆17May 15, 2023Updated 3 years ago
boazbk / panbook
View on GitHub
Package for typesetting a book into PDF and HTML using pandoc and a bunch of other tools
☆16Jul 21, 2020Updated 6 years ago
rsprouse / xray_microbeam_database
View on GitHub
Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)
☆14Oct 8, 2020Updated 5 years ago
YingtongDou / CrossFake
View on GitHub
A cross-lingual COVID-19 fake news dataset
☆14Oct 14, 2021Updated 4 years ago
EricEchemane / EEs-Visualizer-V.2
View on GitHub
🧑‍💻 EEs Visualizer is an interactive tool for visualizing Fundamental Algorithms in Sorting, Searching, and Path Finding. This involves…
☆12Aug 30, 2024Updated last year
maafiah / VXGL
View on GitHub
☆16Mar 2, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ccoreilly / deepspeech-catala
View on GitHub
Deepspeech ASR Model for the Catalan Language
☆17Feb 15, 2021Updated 5 years ago
NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 4 years ago
amazon-science / controllable-readability-summarization
View on GitHub
Generating Summaries with Controllable Readability Levels (EMNLP 2023)
☆15Jul 2, 2026Updated 3 weeks ago
ljuvela / SourceFilterNeuralFormants
View on GitHub
☆21Sep 20, 2024Updated last year
MtSomeThree / constrDecoding
View on GitHub
Constrained Decoding Project
☆20Nov 10, 2023Updated 2 years ago
dustmop / rasterjs
View on GitHub
retro graphics framework
☆12Jun 15, 2024Updated 2 years ago
skinahan / DIVA_PyTorch
View on GitHub
Implementation of the DIVA model of speech acquisition and production using PyTorch
☆23Jan 18, 2023Updated 3 years ago