TUFS Asian Language Parallel Corpus
☆53May 1, 2023Updated 3 years ago
Alternatives and similar repositories for TALPCo
Users that are interested in TALPCo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- English - Indonesian parallel corpora☆17Aug 6, 2018Updated 7 years ago
- CRF syllable segmenter for Thai☆27May 3, 2024Updated 2 years ago
- A public repository for corrupt0 datathon's court data☆11Jul 6, 2019Updated 6 years ago
- The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.☆52Jul 12, 2019Updated 6 years ago
- Parallel Universal Dependencies.☆15May 6, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Yaitron English-Thai and Thai-English dictionary☆34Oct 13, 2020Updated 5 years ago
- ☆18Oct 6, 2022Updated 3 years ago
- basically all words, in a compressed form☆17Jan 9, 2023Updated 3 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- The synonym for thai (open source & open data)☆18Dec 6, 2023Updated 2 years ago
- Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"☆12Dec 8, 2024Updated last year
- Myanmar and Thai Language Resources☆10Jul 18, 2022Updated 3 years ago
- A tool to make spelling Thai more convenient☆11Mar 30, 2024Updated 2 years ago
- We gather Malaysian dataset! https://malaysian-dataset.readthedocs.io/☆339Jan 7, 2026Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Adaptive Machine Translation with Large Language Models☆31Jan 4, 2025Updated last year
- Source code for the NAACL 2021 paper: Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation☆15Jul 19, 2021Updated 4 years ago
- 🐍🍑 Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, …☆21Jun 26, 2024Updated last year
- Tiny language detector + tokenizer for 50+ SE/South Asian languages (Burmese, Karen, Chin, Shan, Mon, Khmer, Lao, Thai, Tamil, Hindi, ……☆38May 24, 2026Updated 3 weeks ago
- ☆12Dec 14, 2020Updated 5 years ago
- A Dataset for Thai Text Summarization with over 310K articles.☆30Feb 4, 2023Updated 3 years ago
- Thai Spelling Check☆42Apr 2, 2023Updated 3 years ago
- A multi-language segmenter using high-order CRF.☆17Feb 27, 2020Updated 6 years ago
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Dec 23, 2024Updated last year
- Indonesian Manually Tagged Corpus☆90Jul 5, 2022Updated 3 years ago
- SCT: An Efficient Self-Supervised Cross-View Training For Sentence Embedding (TACL)☆16Jul 27, 2024Updated last year
- JUMAN++とKNPをDockerで使えるようにする。☆17Jan 19, 2019Updated 7 years ago
- Parallel corpora for the biomedical domain☆51Mar 27, 2026Updated 2 months ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆36Jun 29, 2025Updated 11 months ago
- Pretraining scripts for BART transformer model☆12May 15, 2023Updated 3 years ago
- ☆17Dec 12, 2024Updated last year
- python package for unsupervised text segmentation.☆14Oct 31, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆40Feb 1, 2023Updated 3 years ago
- Curated list of publicly available parallel corpus for Indian Languages☆37Jul 15, 2021Updated 4 years ago
- Drawing tree structures with SVG and JavaScript☆34Aug 2, 2015Updated 10 years ago
- End-to-end integration of HuggingFace's models for sequence labeling.☆11Oct 4, 2020Updated 5 years ago
- A comprehensive evaluation framework for the SEA region☆29Apr 20, 2026Updated last month
- Shan Natural Language Processing tools inspired by PythaiNLP☆14Mar 1, 2026Updated 3 months ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆74Feb 25, 2015Updated 11 years ago