sandy1990418 / ChineseTaiwaneseWhisperLinks
This repository focuses on leveraging OpenAI's Whisper model for speech recognition in Chinese (Mandarin) and Taiwanese Hokkien languages. It includes tools and scripts for data preprocessing, model training, and evaluation, tailored to improve speech recognition accuracy for these languages.
☆54Updated 7 months ago
Alternatives and similar repositories for ChineseTaiwaneseWhisper
Users that are interested in ChineseTaiwaneseWhisper are comparing it to the libraries listed below
Sorting:
- A method that directly addresses the modality gap by aligning speech token with the corresponding text transcription during the tokenizat…☆90Updated last month
- Taiwanese Speech Synthesis with Tacotron2☆22Updated 3 years ago
- **Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…☆102Updated 5 months ago
- ☆92Updated last year
- 56 language, 1 model Multilingual ASR☆25Updated 4 years ago
- fine-tune Whipser model for Taiwanese speech recognition☆33Updated 2 years ago
- ☆10Updated 3 years ago
- Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless…☆35Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Updated 2 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆57Updated 2 years ago
- ☆13Updated last year
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- TransferTTS (Zero-Shot learning of VITS)☆102Updated 3 years ago
- Toolbox for easy and qualitative one-shot voice conversion☆46Updated 3 years ago
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Updated 3 years ago
- Official implementation of MelHuBERT☆67Updated 11 months ago
- ASR text preprocessing utility☆21Updated last year
- ☆25Updated 3 years ago
- Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"☆113Updated 2 months ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆47Updated 3 months ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆46Updated 3 years ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Updated 9 months ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated 2 years ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆19Updated 2 years ago
- ☆87Updated 2 months ago
- multilingual speech aligner☆77Updated last year
- ☆38Updated 4 years ago
- Code:Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Mod…☆25Updated 5 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Updated 2 years ago
- Official release of StyleTalk dataset.☆69Updated last year