ayaka14732 / bert-tokenizer-cantonese
BERT Tokenizer with vocabulary tailored for Cantonese
☆20Updated 2 years ago
Alternatives and similar repositories for bert-tokenizer-cantonese:
Users that are interested in bert-tokenizer-cantonese are comparing it to the libraries listed below
- cantonese-mandarin unsupervised neural translation for sw project☆25Updated last year
- An English-to-Cantonese machine translation model☆49Updated 10 months ago
- Transformers for Cantonese☆56Updated 4 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆35Updated 4 years ago
- Aligner for text-to-speech☆14Updated 6 months ago
- Chinese Mandarin Synthesis Corpus-Female/Emotional☆10Updated 6 months ago
- Taiwanese Speech Synthesis with Tacotron2☆19Updated 2 years ago
- ROUGE for multilingual Summarization☆23Updated 3 years ago
- Unsupervised spoken sentence embeddings☆14Updated 2 years ago
- Revisiting End-to-End Speech-to-Text Translation From Scratch☆12Updated last year
- A frequency lexicon for Hong Kong Cantonese☆21Updated 4 years ago
- 粵語拼音自動標註工具 Cantonese Pronunciation Automatic Labeling Tool☆65Updated 4 months ago
- 粵文語料篩選器 Cantonese text filter☆37Updated this week
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆14Updated 2 years ago
- Phonemes and durations labeling based on whisper small☆11Updated 7 months ago
- Grapheme-to-Phoneme lexicons for Chinese dialects☆67Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆28Updated 9 months ago
- Cantonese segmentation tool 粵語分詞工具☆29Updated 4 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆29Updated 6 months ago
- Production-ready vocoder using BigVSAN☆11Updated 11 months ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆17Updated 2 years ago
- Chinese polyphone disambiguation for Text-to-Speech application☆30Updated 8 months ago
- ☆13Updated 4 months ago
- ☆12Updated last year
- Whisper_MCE☆20Updated 7 months ago
- Non Parallel Voice Conversion based on VITS☆24Updated last year
- Tools for convert Text to IPA in python☆18Updated 2 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Supervoice Speaker Separation Network☆12Updated 8 months ago
- The case study and multilingfual performance of ICASSP submission☆20Updated 2 years ago