vb100 / whisper_ai_finetuneLinks

Fine-tune WhisperAI model to your language

☆21

Alternatives and similar repositories for whisper_ai_finetune

Users that are interested in whisper_ai_finetune are comparing it to the libraries listed below

Sorting:

ylacombe / finetune-hf-vits
Finetune VITS and MMS using HuggingFace's tools
☆158Updated last year
tuanh123789 / Spark-TTS-finetune
finetune llm part for spark-tts model
☆96Updated 3 months ago
roatienza / efficientspeech
PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.
☆172Updated last year
lovemefan / fsmn-vad
A enterprise-grade Voice Activity Detector from modelscope and funasr.
☆103Updated 2 years ago
k2-fsa / colab
Colab notebooks for Next-gen Kaldi
☆28Updated 3 months ago
YuanGongND / gopt
Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
☆178Updated 2 years ago
pengzhendong / pyannote-onnx
ONNX Inference of Pyannote Segmentation
☆92Updated 6 months ago
jonatasgrosman / wav2vec2-sprint
☆195Updated 3 years ago
vineeths96 / Spoken-Keyword-Spotting
In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…
☆102Updated 2 years ago
csukuangfj / kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…
☆202Updated 2 weeks ago
pengzhendong / pysilero
Python Wrapper of Silero VAD
☆56Updated 2 months ago
DakeQQ / Audio-Denoiser-ONNX
Utilizes ONNX Runtime for audio denoising.
☆57Updated last week
harvard-edge / multilingual_kws
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
☆176Updated 7 months ago
lovemefan / paraformer-python
paraformer(chinense asr) online onnx runtime for python
☆48Updated last year
marianne-m / brouhaha-vad
Predicts the level of noise and reverberation on your audiofiles
☆152Updated 3 weeks ago
backspacetg / simul_whisper
Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection
☆68Updated 3 months ago
freds0 / data_augmentation_for_asr
A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.
☆43Updated 3 years ago
fengredrum / finetune-whisper-lora
Fine-Tune Whisper with Transformers and PEFT
☆57Updated last year
csukuangfj / kaldi-native-fbank
Kaldi-compatible online fbank extractor without external dependencies
☆111Updated this week
ccoreilly / wav2vec2-service
☆38Updated 3 years ago
bigcash / awesome-vad
A curated list of awesome voice activity detection
☆59Updated 7 months ago
skit-ai / SpeechLLM
This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…
☆115Updated last year
bayartsogt-ya / whisper-multiple-hf-datasets
Whisper fine-tuning event script to use multiple hf datasets
☆32Updated 2 years ago
HHousen / speaker-change-detection
Speaker change detection using SincNet and an LSTM/Transformer
☆52Updated last month
RevoSpeechTech / speech-datasets-collection
a curated list of speech datasets (110+ datasets, 75+ easy to download)
☆139Updated 2 years ago
k2-fsa / text_search
Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup
☆71Updated 2 weeks ago
vasistalodagala / whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
☆317Updated 2 years ago
dobby-seo / Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
☆108Updated 2 years ago
lovemefan / CT-Transformer-punctuation
A enterprise-grade Chinese-English code switch punctuator from funasr.
☆24Updated last year
ai-zahran / E2E-R
Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring
☆28Updated last year