vb100 / whisper_ai_finetuneLinks
Fine-tune WhisperAI model to your language
☆21Updated last year
Alternatives and similar repositories for whisper_ai_finetune
Users that are interested in whisper_ai_finetune are comparing it to the libraries listed below
Sorting:
- Finetune VITS and MMS using HuggingFace's tools☆158Updated last year
- finetune llm part for spark-tts model☆96Updated 3 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆172Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆103Updated 2 years ago
- Colab notebooks for Next-gen Kaldi☆28Updated 3 months ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆178Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆92Updated 6 months ago
- ☆195Updated 3 years ago
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆102Updated 2 years ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆202Updated 2 weeks ago
- Python Wrapper of Silero VAD☆56Updated 2 months ago
- Utilizes ONNX Runtime for audio denoising.☆57Updated last week
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆176Updated 7 months ago
- paraformer(chinense asr) online onnx runtime for python☆48Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆152Updated 3 weeks ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆68Updated 3 months ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆43Updated 3 years ago
- Fine-Tune Whisper with Transformers and PEFT☆57Updated last year
- Kaldi-compatible online fbank extractor without external dependencies☆111Updated this week
- ☆38Updated 3 years ago
- A curated list of awesome voice activity detection☆59Updated 7 months ago
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆115Updated last year
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆52Updated last month
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆139Updated 2 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆71Updated 2 weeks ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆317Updated 2 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆108Updated 2 years ago
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆24Updated last year
- Code for Fine-tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring☆28Updated last year