sandy1990418 / ChineseTaiwaneseWhisperLinks
This repository focuses on leveraging OpenAI's Whisper model for speech recognition in Chinese (Mandarin) and Taiwanese Hokkien languages. It includes tools and scripts for data preprocessing, model training, and evaluation, tailored to improve speech recognition accuracy for these languages.
☆48Updated 5 months ago
Alternatives and similar repositories for ChineseTaiwaneseWhisper
Users that are interested in ChineseTaiwaneseWhisper are comparing it to the libraries listed below
Sorting:
- A method that directly addresses the modality gap by aligning speech token with the corresponding text transcription during the tokenizat…☆84Updated last month
- Taiwanese Speech Synthesis with Tacotron2☆21Updated 2 years ago
- fine-tune Whipser model for Taiwanese speech recognition☆32Updated 2 years ago
- **Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…☆102Updated 4 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆74Updated 2 years ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆56Updated 2 years ago
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Updated 2 years ago
- Zero-Shot Emotion Style Transfer☆49Updated 3 months ago
- Code for DeSTA2.5-Audio☆99Updated this week
- Toolbox for easy and qualitative one-shot voice conversion☆45Updated 3 years ago
- Official implementation of MelHuBERT☆66Updated 9 months ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- multilingual speech aligner☆75Updated last year
- ☆13Updated 10 months ago
- ☆38Updated 4 years ago
- 56 language, 1 model Multilingual ASR☆25Updated 4 years ago
- ASR text preprocessing utility☆21Updated last year
- Official release of StyleTalk dataset.☆67Updated last year
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆19Updated 2 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆127Updated 2 years ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Updated 8 months ago
- ☆21Updated last year
- ☆41Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆54Updated 2 years ago
- ☆68Updated 10 months ago
- Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"☆106Updated 3 weeks ago
- Monotonic Alignment Search☆96Updated 2 months ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆44Updated 3 years ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆99Updated last year