repo for Tibetan corpora
☆24Apr 10, 2023Updated 3 years ago
Alternatives and similar repositories for Corpora
Users that are interested in Corpora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository will soon contain all scripts and links to the annotated corpora of Tibetan.☆14Feb 4, 2025Updated last year
- ☆19Jun 20, 2017Updated 8 years ago
- ✒️ དག་བྱེད། Dakje, improving your spelling and readability☆12Jul 19, 2022Updated 3 years ago
- Lucene analyzer for Tibetan☆12Oct 23, 2025Updated 7 months ago
- ☆17Oct 8, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- simple CSV database if Tibetan verbs☆22Jul 16, 2015Updated 10 years ago
- 😎 Curated list of Tibetan NLP projects☆44Jul 15, 2020Updated 5 years ago
- 🦜 NLP for Tibetan, in Python.☆40Apr 2, 2026Updated last month
- ☆12Jun 24, 2022Updated 3 years ago
- Resources for spell checking Tibetan☆13Jul 9, 2020Updated 5 years ago
- 🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python☆80Apr 13, 2026Updated last month
- ☆63May 13, 2026Updated last week
- Visualization system for exploring analogy relationship in neural word embedding☆13Nov 11, 2018Updated 7 years ago
- phonetic transcription for Tibetan☆10Mar 13, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CRFs based Chinese word segmentor☆21Oct 8, 2014Updated 11 years ago
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆14Jun 27, 2022Updated 3 years ago
- MNIST of Tibetan handwriting 国产手写藏文MNIST数据集(TibetanMNIST)的图像分类处理与各种好玩的脑洞~☆36Feb 2, 2019Updated 7 years ago
- A collection of popular mono fonts merged with other fonts to support more languages☆17Feb 19, 2022Updated 4 years ago
- Zotero Translators for Korean academic sites including DBpia, RISS, KCI, KISS, earticle, and Scholar.☆22May 13, 2026Updated last week
- A library to calculate dates of the Tibetan lunar calendar☆12Mar 3, 2023Updated 3 years ago
- Tibetan transliteration between EWTS and Unicode☆18May 8, 2026Updated 2 weeks ago
- Tibetan Unicode to Wylie converter. (EWTS-Extended Wylie Transliteration Scheme)☆32Apr 2, 2026Updated last month
- 基于LLaMA2-7B增量预训练的藏文大语言模型TiLamb(Tibetan Large Language Model Base)☆37Apr 3, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- TIP-LAS: An open source toolkit for Tibetan word segmentation and part-of-speech tagging☆84Nov 11, 2022Updated 3 years ago
- ☆19Jan 11, 2018Updated 8 years ago
- Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP☆10Oct 27, 2023Updated 2 years ago
- Sequence alignment and textual reconstruction for Sanskrit texts☆13Updated this week
- Download Secure Monlam Bodyig Tibetan font set up for Mac OS and Windows☆20Mar 4, 2026Updated 2 months ago
- Tibetan phonetics engine in Python☆22Dec 28, 2025Updated 4 months ago
- BERT baselines for extractive question answering on coqa (https://stanfordnlp.github.io/coqa/)☆10Jan 27, 2020Updated 6 years ago
- Chinese Word Segmentation Using MM/CRF/Bi-LSTM/Bi-LSTM-CRF/BERT-LSTM, 中文分词,使用 最大词匹配、CRF(CRF++)、Bi-LSTM (+CRF)、BERT-Bi-LSTM☆30Apr 2, 2020Updated 6 years ago
- Adds all required Fedora objects to allow users to ingest and retrieve Oral Histories (video/audio) files through the Islandora interface☆12May 26, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A cute toolkit for OCR with GUI, including image preprocessing and text recognition. Works out of the box. 一只小小的OCR工具箱,包括图像预处理和文字识别等功能,…☆18Nov 6, 2025Updated 6 months ago
- Viterbi part-of-speech tagger, trained on Wall Street Journal (WSJ) data☆14Mar 6, 2018Updated 8 years ago
- A TensorFlow implementation of FlowQA☆15Nov 24, 2018Updated 7 years ago
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Dec 2, 2016Updated 9 years ago
- a corpus containing 4.5K conversations from the Conversational Question-Answering dataset CoQA, for a total of 53K follow-up question-ans…☆16Jun 12, 2023Updated 2 years ago
- a simple implementation of part-of-speech tagging with hmm☆13Feb 26, 2019Updated 7 years ago
- An OCR application focused on machine-print Tibetan text.☆18Jun 29, 2018Updated 7 years ago