tibetan-nlp / awesome-tibetan-nlpLinks
😎 Curated list of Tibetan NLP projects
☆41Updated 5 years ago
Alternatives and similar repositories for awesome-tibetan-nlp
Users that are interested in awesome-tibetan-nlp are comparing it to the libraries listed below
Sorting:
- Linguistically analyzed Classical Tibetan texts☆26Updated 4 years ago
- 🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python☆69Updated 2 weeks ago
- 🦜 NLP for Tibetan, in Python.☆37Updated 2 years ago
- repo for Tibetan corpora☆21Updated 2 years ago
- Multilingual sentence alignment using sentence embeddings☆126Updated 11 months ago
- ☆18Updated 8 years ago
- Hunspell files for Tibetan☆22Updated 10 years ago
- Useful resources for Mongolian NLP☆189Updated 10 months ago
- OpusFilter - Parallel corpus processing toolkit☆110Updated 2 weeks ago
- Improved Sentence Alignment in Linear Time and Space☆184Updated 2 years ago
- Sentence aligner☆118Updated 4 years ago
- Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning☆19Updated 2 years ago
- 🙊 software for creating speech recognition models.☆159Updated last year
- Machine-Translation-based sentence alignment tool for parallel text☆313Updated 4 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆221Updated last year
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆159Updated last year
- Machine Translation (MT) Evaluation Scripts☆17Updated last year
- Machine Translation (MT) Preparation Scripts☆33Updated 4 months ago
- ✒️ དག་བྱེད། Dakje, improving your spelling and readability☆11Updated 3 years ago
- ☆56Updated last week
- Machine Translation (MT) Web Interface for OpenNMT and FairSeq models using CTranslate and Streamlit☆15Updated 3 years ago
- TED parallel Corpora is growing collection of Bilingual parallel corpora, Multilingual parallel corpora and Monolingual corpora extracted…☆251Updated 9 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆283Updated this week
- ☆76Updated last month
- A tool that locates, downloads, and extracts machine translation corpora☆158Updated last month
- We use phonetics as a feature to create a joint semantic-phonetic embedding and improve the neural machine translation between Chinese an…☆12Updated 4 years ago
- Aligned bilingual word vectors for English and Chinese☆11Updated 7 years ago
- Bitextor generates translation memories from multilingual websites☆296Updated 11 months ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆50Updated last year
- cLang-8 is a dataset for grammatical error correction.☆109Updated 3 years ago