chenchenzi / HKCantonese_modelsLinks
This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.
☆20Updated last year
Alternatives and similar repositories for HKCantonese_models
Users that are interested in HKCantonese_models are comparing it to the libraries listed below
Sorting:
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆40Updated 5 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36Updated last year
- cantonese-mandarin unsupervised neural translation for sw project☆28Updated 2 years ago
- Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"☆10Updated 9 months ago
- ☆58Updated last year
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆16Updated 3 years ago
- ☆14Updated last year
- Official Code for ParrotTTS☆58Updated last year
- ☆14Updated 6 months ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Updated 2 years ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆14Updated 2 years ago
- ☆20Updated 9 months ago
- Official release of StyleTalk dataset.☆70Updated last year
- asr2k☆52Updated last year
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆25Updated last year
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆29Updated 2 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Updated last year
- ☆10Updated last year
- ☆15Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 11 months ago
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆13Updated 2 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Updated 10 months ago
- Unsupervised spoken sentence embeddings☆14Updated 3 years ago
- ☆14Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆32Updated last year
- pytorch model for contexless-phoneme prediction from speech audio☆30Updated 2 months ago
- Putting flows on top of neural transducers for better TTS☆64Updated last week
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆25Updated last year
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- Workflow for forced alignment between languages☆23Updated 2 weeks ago