gwinterstein / CantoMapView external linksLinks
An audio and transcribed corpus of contemporary Hong Kong Cantonese
☆40Dec 30, 2020Updated 5 years ago
Alternatives and similar repositories for CantoMap
Users that are interested in CantoMap are comparing it to the libraries listed below
Sorting:
- ☆99Feb 1, 2024Updated 2 years ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆85Nov 3, 2025Updated 3 months ago
- A frequency lexicon for Hong Kong Cantonese☆23Aug 27, 2020Updated 5 years ago
- ☆10Apr 17, 2024Updated last year
- An English-to-Cantonese machine translation model☆55Mar 26, 2025Updated 10 months ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- cantonese-mandarin unsupervised neural translation for sw project☆28May 2, 2023Updated 2 years ago
- Transformers for Cantonese☆57Oct 24, 2020Updated 5 years ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆16May 8, 2022Updated 3 years ago
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆21Nov 14, 2024Updated last year
- BERT Tokenizer with vocabulary tailored for Cantonese☆23Oct 27, 2022Updated 3 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆92Oct 17, 2021Updated 4 years ago
- A family of efficient speech models for multilingual phone recognition☆37Oct 23, 2025Updated 3 months ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆43Dec 6, 2022Updated 3 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- text to speech☆10Mar 19, 2024Updated last year
- Getting confidences from any end-to-end systems☆11May 24, 2023Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- ☆13Sep 25, 2024Updated last year
- 粵文語料篩選器 Cantonese text filter☆41Feb 4, 2026Updated last week
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)☆12Nov 14, 2024Updated last year
- Rezonator: Dynamics of human engagement☆34Feb 2, 2026Updated last week
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- Spiking neural networks (SNNs) for speech classification☆12Mar 14, 2022Updated 3 years ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- A Cantonese-English translator based on prompt engineering☆12Sep 19, 2023Updated 2 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Mar 14, 2025Updated 11 months ago
- Cantonese segmentation tool 粵語分詞工具☆30Aug 22, 2020Updated 5 years ago
- An automatic speech recognition environment for Icelandic based on Kaldi☆14Oct 12, 2017Updated 8 years ago
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Aug 24, 2021Updated 4 years ago
- Scrape cantonese syllables from CUHK Multi-function Chinese Character Database.☆10Mar 18, 2015Updated 10 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 10 months ago
- The Cantonese Wordnet☆14Dec 4, 2023Updated 2 years ago