An English-to-Cantonese machine translation model
☆55Mar 26, 2025Updated last year
Alternatives and similar repositories for TransCan
Users that are interested in TransCan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- Transformers for Cantonese☆58Oct 24, 2020Updated 5 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆94Oct 17, 2021Updated 4 years ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆23Oct 27, 2022Updated 3 years ago
- A Python script for scraping LIHKG☆32Mar 7, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 開放粵語字典 - 現代粵語字音數據庫☆66Mar 30, 2023Updated 3 years ago
- 《香港二十世紀中期粵語語料庫》打包器☆16Apr 12, 2016Updated 10 years ago
- 粵文語料篩選器 Cantonese text filter☆42Feb 4, 2026Updated 2 months ago
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Aug 24, 2021Updated 4 years ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- ☆40Updated this week
- Dictionaries for StarCC, the next generation of Simplified-Traditional Chinese conversion framework☆12Jun 20, 2022Updated 3 years ago
- Cantonese Linguistics and NLP☆404Apr 14, 2026Updated 2 weeks ago
- cantonese-mandarin unsupervised neural translation for sw project☆29May 2, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 粵語對話語料☆30May 12, 2023Updated 2 years ago
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆16May 8, 2022Updated 3 years ago
- Cantoboard - Smart Cantonese Keyboard on iOS☆72Oct 15, 2023Updated 2 years ago
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆27Nov 14, 2024Updated last year
- 電腦用漢字粵語拼音表 / Cantonese Pronunciation List of the Characters for Computers☆63Jan 11, 2024Updated 2 years ago
- Cantonese segmentation tool 粵語分詞工具☆31Aug 22, 2020Updated 5 years ago
- A Github Action which tags the repo, intended to be used only when on the main branch of your repo at the end of an Nx powered CI run☆13Jul 8, 2021Updated 4 years ago
- ☆13Jul 11, 2018Updated 7 years ago
- TypeDuck: Cantonese for everyone at your fingertips☆23Nov 17, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Dataset for analysing Propagation of COVID-19 Misinformation on Twitter☆18Jan 31, 2024Updated 2 years ago
- 汉语方言字 https://fangyanzi.vercel.app☆24Nov 1, 2022Updated 3 years ago
- Open Chinese Convert(OpenCC, 開放中文轉換) binding for the Rust language for conversion between Traditional Chinese and Simplified Chinese.☆37Nov 24, 2025Updated 5 months ago
- Cantonese Video Transcribe Service☆25Jul 25, 2025Updated 9 months ago
- JAX implementation of the bart-base model☆34Apr 11, 2023Updated 3 years ago
- ☆10Apr 17, 2024Updated 2 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- 仙人掌明體 Cactus Classical Serif☆34Aug 29, 2025Updated 8 months ago
- JAX implementation of LLaMA, aiming to train LLaMA on Google Cloud TPU☆14Jul 22, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 将声音按照每一句话进行切割☆16Sep 25, 2019Updated 6 years ago
- Scrape cantonese syllables from CUHK Multi-function Chinese Character Database.☆11Mar 18, 2015Updated 11 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Mar 23, 2026Updated last month
- ☆103Feb 1, 2024Updated 2 years ago
- ☆21Apr 21, 2022Updated 4 years ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆87Nov 3, 2025Updated 6 months ago
- 朱古力黑體 Chocolate Classical Sans☆44Aug 29, 2025Updated 8 months ago