ga642381 / Taiwanese-WhisperLinks
fine-tune Whipser model for Taiwanese speech recognition
☆35Updated 2 years ago
Alternatives and similar repositories for Taiwanese-Whisper
Users that are interested in Taiwanese-Whisper are comparing it to the libraries listed below
Sorting:
- Taiwanese Speech Synthesis with Tacotron2☆22Updated 3 years ago
- ☆13Updated last year
- A method that directly addresses the modality gap by aligning speech token with the corresponding text transcription during the tokenizat…☆100Updated 3 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆77Updated 2 years ago
- Official implementation of MelHuBERT☆68Updated last year
- ASR text preprocessing utility☆21Updated last year
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆19Updated 2 years ago
- ☆31Updated 2 years ago
- ☆10Updated 3 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Updated 2 years ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆80Updated 2 years ago
- ☆41Updated 2 years ago
- Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless…☆35Updated 2 years ago
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆25Updated 2 years ago
- Code for DeSTA2.5-Audio☆125Updated 4 months ago
- **Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…☆102Updated 8 months ago
- ☆22Updated 6 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Updated 2 years ago
- Pre-trained grapheme-to-phoneme (G2P) models☆26Updated 4 years ago
- Mutiband version of HIFIGAN☆19Updated 5 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆51Updated 5 months ago
- Prosodic Speech Segmentation with Transformers☆26Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Updated 5 months ago
- A TTS Trained on Universal Audio.☆41Updated 6 months ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Updated 2 years ago
- ☆22Updated last year
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Updated 5 years ago
- Official GitHub repository for paper "SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Informa…☆19Updated 3 months ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆62Updated 2 years ago