sandy1990418 / ChineseTaiwaneseWhisper
This repository focuses on leveraging OpenAI's Whisper model for speech recognition in Chinese (Mandarin) and Taiwanese Hokkien languages. It includes tools and scripts for data preprocessing, model training, and evaluation, tailored to improve speech recognition accuracy for these languages.
☆35Updated last month
Alternatives and similar repositories for ChineseTaiwaneseWhisper:
Users that are interested in ChineseTaiwaneseWhisper are comparing it to the libraries listed below
- fine-tune Whipser model for Taiwanese speech recognition☆29Updated 2 years ago
- Taiwanese Speech Synthesis with Tacotron2☆19Updated 2 years ago
- ☆13Updated 6 months ago
- ASR text preprocessing utility☆21Updated 7 months ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆14Updated 3 months ago
- Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless…☆35Updated last year
- A Study of Low-Resource Speech Commands Recognition Based on Adversarial Reprogramming☆18Updated last year
- Official implementation of MelHuBERT☆65Updated 5 months ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆50Updated 2 years ago
- Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"☆72Updated last month
- ☆10Updated 2 years ago
- **Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…☆99Updated last year
- Pre-trained Wav2vec2.0 for Mandarin☆39Updated 2 years ago
- ☆31Updated last year
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Updated 2 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆41Updated last year
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆50Updated last year
- ☆41Updated last year
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆19Updated 4 months ago
- ☆22Updated 9 months ago
- ☆21Updated 7 months ago
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆22Updated last year
- ☆10Updated 4 months ago
- ☆30Updated 4 months ago
- A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…☆33Updated last year
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆17Updated 2 years ago
- [TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation☆29Updated 6 months ago
- ☆11Updated last year
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆28Updated last year
- Repository for Accent Recognition (Hackathon @SLT2022)☆27Updated 10 months ago