Spoken Cantonese from Hong Kong.
☆30May 6, 2026Updated 3 weeks ago
Alternatives and similar repositories for UD_Cantonese-HK
Users that are interested in UD_Cantonese-HK are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spoken mandarin Chinese from Hong Kong.☆13May 6, 2026Updated 3 weeks ago
- A frequency lexicon for Hong Kong Cantonese☆25Aug 27, 2020Updated 5 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆94Oct 17, 2021Updated 4 years ago
- Transformers for Cantonese☆58Oct 24, 2020Updated 5 years ago
- Dataset containing Semantic Relations and Metadata, for Training and Evaluating Distributional Semantic Models in English and Mandarin Ch…☆16Aug 7, 2017Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This is an Object Oriented implementation of a Trie in python. The class contains setter and getter methods, and implements several usefu…☆15Jan 9, 2018Updated 8 years ago
- Cantonese Linguistics and NLP☆406Updated this week
- Parallel Universal Dependencies.☆13May 21, 2026Updated last week
- A Package for Cantonese Tokenisation☆18Jun 17, 2021Updated 4 years ago
- 粵文語料篩選器 Cantonese text filter☆43Feb 4, 2026Updated 3 months ago
- 電腦用漢字粵語拼音表 / Cantonese Pronunciation List of the Characters for Computers☆64Jan 11, 2024Updated 2 years ago
- Jupyter notebooks for course "Computational Morphology with HFST".☆21Oct 5, 2022Updated 3 years ago
- CoNLL-U format library for Python☆15Apr 7, 2015Updated 11 years ago
- The Cantonese Wordnet☆14Dec 4, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Website for the association of linguistics students at the University of Tübingen☆13Updated this week
- ☆102Feb 1, 2024Updated 2 years ago
- Apertium linguistic data for Kyrgyz☆17Apr 29, 2026Updated last month
- ☆31Jun 2, 2018Updated 7 years ago
- ☆10Apr 17, 2024Updated 2 years ago
- Nanyang Technological University - Multilingual Corpus (STB subcorpora)☆12Mar 11, 2019Updated 7 years ago
- ☆12Sep 14, 2021Updated 4 years ago
- English data☆226May 20, 2026Updated last week
- Simple Python library for doing (multiple) sequence alignment☆16Jun 24, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Scrape cantonese syllables from CUHK Multi-function Chinese Character Database.☆11Mar 18, 2015Updated 11 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Apr 29, 2026Updated last month
- ☆11May 25, 2023Updated 3 years ago
- Interspeech 2019 tutorial materials☆49Sep 26, 2019Updated 6 years ago
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Aug 24, 2021Updated 4 years ago
- Phraseg - 一言:新詞發現工具包☆26Nov 30, 2021Updated 4 years ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14May 19, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The official implementation of EMNLP 2021 paper "#HowYouTagTweets: Learning User Hashtagging Preferences via Personalized Topic Attention…☆11Feb 21, 2023Updated 3 years ago
- Attentional Neural Network that translates text to phones.☆11Jan 25, 2018Updated 8 years ago
- 《香港二十世紀中期粵語語料庫》打包器☆16Apr 12, 2016Updated 10 years ago
- Upcoming ACL 2020 paper☆26May 8, 2020Updated 6 years ago
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆16May 8, 2022Updated 4 years ago
- Kyrgyz language processing software, models and datasets.☆33May 8, 2026Updated 3 weeks ago
- Support material and source code for the model described in : "A Recurrent Encoder-Decoder Approach With Skip-Filtering Connections For M…☆13Sep 19, 2017Updated 8 years ago