gentaiscool / indonesian-nlpView external linksLinks
A curated list of research papers and resources on Indonesian languages
☆40Mar 21, 2024Updated last year
Alternatives and similar repositories for indonesian-nlp
Users that are interested in indonesian-nlp are comparing it to the libraries listed below
Sorting:
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Sep 16, 2024Updated last year
- Welcome to our repository! This repository hosts the data on "IndoCollex: A Testbed for Morphological Transformation of Indonesian Word …☆23Aug 10, 2021Updated 4 years ago
- ☆21Nov 30, 2019Updated 6 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- ☆20Apr 5, 2021Updated 4 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- Indonesian stemmer - Pustaka JavaScript untuk mengambil kata dasar dari kata berimbuhan pada bahasa Indonesia.☆39Feb 17, 2021Updated 4 years ago
- Quora Paraphrasing Dataset Bahasa Indonesia Version☆11Apr 18, 2021Updated 4 years ago
- ☆10Mar 20, 2021Updated 4 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- VoxAngeles Corpus☆13Aug 23, 2025Updated 5 months ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Feb 13, 2020Updated 6 years ago
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- A collection of various NLP datasets, mainly Indonesia-related languages.☆15Apr 23, 2022Updated 3 years ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆13Oct 8, 2020Updated 5 years ago
- A dataset for Indonesian Named Entity Recognizer☆30Dec 10, 2020Updated 5 years ago
- Perubahan historis hasil amandemen Undang-Undang Dasar Negara Republik Indonesia Tahun 1945.☆30Nov 3, 2019Updated 6 years ago
- Simple app to estimate Indonesian severance pay. This is not legal advice.☆12Aug 16, 2023Updated 2 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- Deepspeech ASR Model for the Catalan Language☆17Feb 15, 2021Updated 5 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- Austronesian Comparative Dictionary☆16Jan 7, 2025Updated last year
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- Pedoman dasar bahasa pemrograman Swift adalah terjemahan halaman https://docs.swift.org dalam bahasa indonesia.☆11Jun 14, 2021Updated 4 years ago
- A list of Indonesian NLP resources.☆289Jan 18, 2022Updated 4 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- SemEval 2019 Task 4: Hyperpartisan News Detection☆13Nov 9, 2019Updated 6 years ago
- ☆19Sep 20, 2024Updated last year
- A lightweight Python library for running TTS models with a unified API.☆21Feb 18, 2025Updated 11 months ago