vb100 / whisper_ai_finetune
Fine-tune WhisperAI model to your language
☆20Updated last year
Alternatives and similar repositories for whisper_ai_finetune:
Users that are interested in whisper_ai_finetune are comparing it to the libraries listed below
- ☆38Updated 3 years ago
- Fine-Tune Whisper with Transformers and PEFT☆55Updated last year
- Tunable pipelines☆33Updated 2 months ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆302Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆96Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆85Updated 4 months ago
- ASR教程: https://dataxujing.github.io/ASR-paper/☆24Updated 9 months ago
- Repository contains code to fine-tune WhisperASR model☆23Updated 2 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆41Updated last year
- paraformer(chinense asr) online onnx runtime for python☆43Updated last year
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆50Updated 9 months ago
- Official Code for ParrotTTS☆48Updated 6 months ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated 2 years ago
- ☆130Updated 4 months ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated last year
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆63Updated 2 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- A framework for automatic speech recognition☆49Updated 2 years ago
- Huawei Grad-TTS for Chinese☆50Updated last year
- ☆27Updated 3 weeks ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆41Updated 3 years ago
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆101Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆39Updated last year
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆156Updated last year
- Port of Funasr's Paraformer model in C/C++☆32Updated 10 months ago
- Python Wrapper of Silero VAD☆51Updated this week
- ☆65Updated 7 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆96Updated last month