chenchenzi/HKCantonese_models

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chenchenzi/HKCantonese_models)

chenchenzi / HKCantonese_models

This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.

☆29

Alternatives and similar repositories for HKCantonese_models

Users that are interested in HKCantonese_models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gwinterstein / CantoMap
View on GitHub
An audio and transcribed corpus of contemporary Hong Kong Cantonese
☆41Dec 30, 2020Updated 5 years ago
MontrealCorpusTools / speechcorpustools
View on GitHub
Easier analysis of large speech corpora
☆24Jun 22, 2021Updated 5 years ago
WangHelin1997 / Automatic_Speech_Annotator
View on GitHub
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…
☆33Jun 14, 2024Updated 2 years ago
santiagobarreda / FastTrack
View on GitHub
A Praat plugin for fast, accurate, (nearly) automatic formant-tracking
☆87Oct 27, 2025Updated 8 months ago
ymgw55 / WSMD
View on GitHub
Improving word mover’s distance by leveraging self-attention matrix (Published in EMNLP 2023 Findings)
☆10Mar 10, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MLSpeech / DeepPhoneticToolsTutorial
View on GitHub
Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15
☆12Apr 17, 2017Updated 9 years ago
ayaka14732 / gpt4-cantonese-english-translator
View on GitHub
A Cantonese-English translator based on prompt engineering
☆12Sep 19, 2023Updated 2 years ago
dustinfife / fifer
View on GitHub
a collection of R functions for data manipulation, data analysis, and plotting
☆14Oct 29, 2020Updated 5 years ago
paramiai / cantoformer
View on GitHub
Transformers for Cantonese
☆58Oct 24, 2020Updated 5 years ago
JinchaoLove / CUHK-PhD-Thesis-Template
View on GitHub
Latex template for CUHK PhD Thesis
☆14Jun 29, 2025Updated last year
pariajm / english-fisher-annotations
View on GitHub
A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset
☆13May 2, 2021Updated 5 years ago
shui-dun / multimodal_ad
View on GitHub
☆11Jul 14, 2023Updated 3 years ago
mktiede / GetContours
View on GitHub
Matlab tool for interactively extracting tongue contours from Ultrasound movie or DICOM sequences
☆17Apr 30, 2021Updated 5 years ago
Shelton1013 / Whisper_MCE
View on GitHub
[ICASSP‘25] Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics
☆39Aug 10, 2025Updated 11 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Annafavaro / PARKCELEB
View on GitHub
☆11Jun 13, 2026Updated last month
croz-ltd / Rocket.Chat.App-Remind
View on GitHub
☆13Jan 7, 2023Updated 3 years ago
rctatman / SrtToTextgrid
View on GitHub
Python script to convert .srt subtitle files to Praat .textgrid files
☆17Jul 10, 2024Updated 2 years ago
kosuke-kitahara / xlsr-wav2vec2-phoneme-recognition
View on GitHub
☆27Mar 29, 2021Updated 5 years ago
tjmahr / readtextgrid
View on GitHub
Read in a 'Praat' 'TextGrid' File
☆17Oct 28, 2025Updated 8 months ago
HLTCHKUST / cantonese-asr
View on GitHub
☆103Feb 1, 2024Updated 2 years ago
SimonGreenhill / rcldf
View on GitHub
rcldf - The R library for reading CLDF files
☆16Updated this week
tobiasrordorf / SRT-to-CSV-and-audio-split
View on GitHub
Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)
☆20Nov 14, 2019Updated 6 years ago
CanCLID / canto-filter
View on GitHub
粵文語料篩選器 Cantonese text filter
☆43Feb 4, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
litagin02 / anime_speaker_embedding
View on GitHub
Speaker embedding for anime speech domain based on ECAPA_TDNN
☆21Jun 22, 2025Updated last year
alin995 / speech_synthesis
View on GitHub
语音合成从零开始
☆11Nov 28, 2023Updated 2 years ago
chenchenzi / P2FA_Mandarin_py3
View on GitHub
Modified Python3 P2FA for Mandarin
☆10Sep 21, 2020Updated 5 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
BirdVox / PCEN-SNR
View on GitHub
Audio activity detector based on per-channel energy normalization (PCEN)
☆32Nov 16, 2018Updated 7 years ago
PedroEstevesPT / kaldi_toy_example
View on GitHub
Toy example to illustrate how to use kaldi recipes.
☆13Mar 11, 2021Updated 5 years ago
LingweiMeng / MyChatGPT
View on GitHub
A casual and simple ChatGPT Python script that can run using terminal (as long as you have an API). Support Azure API.
☆20May 3, 2025Updated last year
johnwdubois / rezonator
View on GitHub
Rezonator: Dynamics of human engagement
☆34Jul 8, 2026Updated 2 weeks ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
evelynkyl / yue_nmt
View on GitHub
Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project
☆16Oct 28, 2022Updated 3 years ago
ORI-Muchim / PolyLangVITS
View on GitHub
Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)
☆75Feb 28, 2024Updated 2 years ago
yinql1995 / Fine-grained-Multimodal-DeepFake-Classification
View on GitHub
☆18Jun 21, 2024Updated 2 years ago
ASLP-lab / WenetSpeech-Yue
View on GitHub
A Large-scale Cantonese Speech Corpus with Multi-dimensional Annotation
☆344Jun 6, 2026Updated last month
LeonVitanos / Wallch
View on GitHub
Linux wallpaper changer
☆18Jan 23, 2026Updated 6 months ago
prosodylab / Prosodylab-Aligner
View on GitHub
Python interface for forced audio alignment using HTK and SoX
☆351Jun 28, 2020Updated 6 years ago
chutaklee / CantoASR
View on GitHub
Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)
☆16May 8, 2022Updated 4 years ago