andreihar / taibun
Taiwanese Hokkien Transliterator and Tokeniser
☆21Updated 2 weeks ago
Related projects: ⓘ
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆34Updated 3 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆37Updated last year
- ☆72Updated 7 months ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆42Updated last year
- Transformers for Cantonese☆54Updated 3 years ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆39Updated 6 months ago
- fine-tune Whipser model for Taiwanese speech recognition☆25Updated last year
- 粵文語料篩選器 Cantonese text filter☆33Updated 2 weeks ago
- ☆26Updated 3 months ago
- Workflow for forced alignment between languages☆17Updated 7 months ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆78Updated 9 months ago
- 台語、族語、客語的語料清單、彙整☆37Updated 4 years ago
- Universal multilingual automatic speech transcription into IPA☆51Updated 3 weeks ago
- Read, write, and manipulate Praat TextGrid files with Python☆123Updated 9 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆27Updated 4 months ago
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Updated last year
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆41Updated last year
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆18Updated last year
- Taiwanese Speech Synthesis with Tacotron2☆18Updated last year
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆26Updated last year
- 使用 pinyin-data 和 phrase-pinyin-data 中的拼音数据文件覆盖 pypinyin 中的内置拼音数据☆40Updated 6 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- An English-to-Cantonese machine translation model☆48Updated 5 months ago
- ☆39Updated this week
- ASR text preprocessing utility☆20Updated last month
- 粵語拼音自動標註工具 Cantonese Pronunciation Automatic Labeling Tool☆57Updated 3 weeks ago
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆33Updated 3 years ago
- Prosodic Speech Segmentation with Transformers☆22Updated 6 months ago
- Tools for convert Text to IPA in python☆16Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆127Updated this week