vb100 / whisper_ai_finetune
Fine-tune WhisperAI model to your language
☆21Updated last year
Alternatives and similar repositories for whisper_ai_finetune
Users that are interested in whisper_ai_finetune are comparing it to the libraries listed below
Sorting:
- Finetune VITS and MMS using HuggingFace's tools☆151Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆77Updated 6 months ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆51Updated 3 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆62Updated last month
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆100Updated 2 years ago
- Python Wrapper of Silero VAD☆53Updated last week
- finetune llm part for spark-tts model☆69Updated last month
- Fine-Tune Whisper with Transformers and PEFT☆55Updated last year
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆104Updated 10 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆73Updated 9 months ago
- ☆138Updated 5 months ago
- Predicts the level of noise and reverberation on your audiofiles☆149Updated 11 months ago
- ☆26Updated 3 months ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆93Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorch☆127Updated 5 months ago
- ☆37Updated last year
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆158Updated last year
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆59Updated 4 years ago
- Colab notebooks for Next-gen Kaldi☆27Updated last month
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆23Updated last year
- one script for xls-r/xlsr/whisper fine-tuning☆41Updated last year
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆101Updated 2 years ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆79Updated last year
- ☆38Updated 3 years ago
- ☆39Updated 9 months ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆131Updated 2 years ago
- Chinese and English Bilinguish G2P☆21Updated last year
- Pre-trained grapheme-to-phoneme (G2P) models☆25Updated 3 years ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆172Updated 2 years ago
- paraformer(chinense asr) online onnx runtime for python☆44Updated last year