TUFS Asian Language Parallel Corpus
☆52May 1, 2023Updated 2 years ago
Alternatives and similar repositories for TALPCo
Users that are interested in TALPCo are comparing it to the libraries listed below
Sorting:
- Kamus morfologi untuk bahasa Melayu/Indonesia☆17Nov 23, 2024Updated last year
- ☆10Jan 14, 2025Updated last year
- Java library to tokenize Thai text into a list of TCCs☆19May 30, 2017Updated 8 years ago
- A public repository for corrupt0 datathon's court data☆11Jul 6, 2019Updated 6 years ago
- Parallel Universal Dependencies.☆15Nov 12, 2025Updated 3 months ago
- Yaitron English-Thai and Thai-English dictionary☆34Oct 13, 2020Updated 5 years ago
- English - Indonesian parallel corpora☆17Aug 6, 2018Updated 7 years ago
- Unsupervised parallel sentence extraction from comparable corpora☆16Aug 6, 2019Updated 6 years ago
- Indonesian-English Bilingual Corpus☆18Jul 16, 2012Updated 13 years ago
- ☆18Oct 6, 2022Updated 3 years ago
- ☆40May 4, 2024Updated last year
- Myanmar and Thai Language Resources☆10Jul 18, 2022Updated 3 years ago
- ☆12Dec 7, 2022Updated 3 years ago
- The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.☆49Jul 12, 2019Updated 6 years ago
- A comprehensive evaluation framework for the SEA region☆19Feb 16, 2026Updated 2 weeks ago
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"☆12Dec 8, 2024Updated last year
- Dataset of Burmese proverbs☆11Jun 26, 2017Updated 8 years ago
- ☆11Dec 14, 2020Updated 5 years ago
- Shan Natural Language Processing tools inspired by PythaiNLP☆14Updated this week
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Jun 14, 2024Updated last year
- Pretraining scripts for BART transformer model☆12May 15, 2023Updated 2 years ago
- A Dataset for Thai Text Summarization with over 310K articles.☆29Feb 4, 2023Updated 3 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆34Jun 29, 2025Updated 8 months ago
- SnapLogic Snap Recommendation Workshop with Decision Trees and Deep Learning☆14Jun 5, 2019Updated 6 years ago
- Awesome Lao Natural Language Processing☆16Mar 7, 2025Updated 11 months ago
- Curated list of publicly available parallel corpus for Indian Languages☆36Jul 15, 2021Updated 4 years ago
- PyThaiNLP For spaCy☆16Feb 5, 2026Updated last month
- python package for unsupervised text segmentation.