phineas-pta / fine-tune-whisper-viLinks

jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2

☆19

Alternatives and similar repositories for fine-tune-whisper-vi

Users that are interested in fine-tune-whisper-vi are comparing it to the libraries listed below

Sorting:

khanld / chunkformer
ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription
☆68Updated last week
anhnh2002 / XTTSv2-Finetuning-for-New-Languages
☆177Updated 11 months ago
tuanh123789 / Spark-TTS-finetune
finetune llm part for spark-tts model
☆110Updated 7 months ago
HKAB / whisper-finetune-vietnamese
Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM
☆38Updated 2 years ago
tuanh123789 / Train_Hifigan_XTTS
This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.
☆85Updated last year
thanhpv2102 / Vietnam-Celeb.Interspeech
Official repo for the Vietnam-Celeb dataset
☆23Updated 2 years ago
phineas-pta / speech-synthesis-ngngngan
python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn
☆14Updated last year
v-nhandt21 / Viphoneme
Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA
☆99Updated last year
ducanhdt / openai_whisper_finetuning
☆49Updated 2 years ago
v-nhandt21 / Vinorm
Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…
☆66Updated 10 months ago
ducnt18121997 / Viet-Text-Normalization
A Python library for text normalization, specifically designed for Vietnamese and English text processing. This library provides comprehe…
☆12Updated 7 months ago
ylacombe / finetune-hf-vits
Finetune VITS and MMS using HuggingFace's tools
☆177Updated last year
khanld / ASR-Wav2vec-Finetune
Finetune Wa2vec 2.0 For Speech Recognition
☆142Updated 9 months ago
tuanio / noisy-student-training-asr
Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem
☆97Updated 5 months ago
heraclex12 / vietpunc
Vietnamese Punctuation Prediction using Pretrained Language Models
☆14Updated 3 years ago
VinAIResearch / PhoST
A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)
☆22Updated 5 months ago
halsay / ASR-TTS-paper-daily
Update ASR paper everyday
☆375Updated this week
janhq / WhisperSpeech
Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…
☆17Updated 10 months ago
v-nhandt21 / ViSV2TTS
Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS
☆57Updated last year
MatthewCYM / VoiceBench
VoiceBench: Benchmarking LLM-Based Voice Assistants
☆300Updated 3 months ago
nguyenvulebinh / ViStreamASR
ViStreamASR - Real-Time Vietnamese Speech Recognition
☆47Updated 4 months ago
Vietnam-Celeb / Vietnam-Celeb
☆11Updated 2 years ago
mbzuai-nlp / ArTST
☆59Updated 4 months ago
VinAIResearch / XPhoneBERT
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
☆343Updated last year
dangvansam / viet-asr
VietASR - Vietnamese Automatic Speech Recognition
☆157Updated last year
khanld / Wav2vec2-Pretraining
Wav2vec 2.0 Self-Supervised Pretraining
☆56Updated 9 months ago
wonjune-kang / llm-speech-summarization
Prompting Large Language Models with Audio for General-Purpose Speech Summarization
☆18Updated 6 months ago
vasistalodagala / whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
☆352Updated 2 years ago
fengredrum / finetune-whisper-lora
Fine-Tune Whisper with Transformers and PEFT
☆57Updated 2 years ago
dangvansam / viet-tts
VietTTS: An Open-Source Vietnamese Text to Speech
☆74Updated 11 months ago