vb100 / whisper_ai_finetuneLinks
Fine-tune WhisperAI model to your language
☆21Updated 2 years ago
Alternatives and similar repositories for whisper_ai_finetune
Users that are interested in whisper_ai_finetune are comparing it to the libraries listed below
Sorting:
- [WIP] Scripts for fine-tuning Whisper☆222Updated 2 years ago
 - A enterprise-grade Voice Activity Detector from modelscope and funasr.☆114Updated 2 years ago
 - PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆177Updated last year
 - Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆94Updated 7 months ago
 - ONNX Inference of Pyannote Segmentation☆95Updated 10 months ago
 - Finetune VITS and MMS using HuggingFace's tools☆170Updated last year
 - Colab notebooks for Next-gen Kaldi☆29Updated 3 weeks ago
 - Putting flows on top of neural transducers for better TTS☆64Updated 3 weeks ago
 - Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆187Updated 2 years ago
 - A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆175Updated last year
 - Onnx wrapper for espnet infrernce model☆169Updated 2 months ago
 - A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆46Updated 4 years ago
 - ☆201Updated 3 years ago
 - ☆39Updated 3 years ago
 - Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
 - Barkify: an unoffical training implementation of Bark TTS by suno-ai☆127Updated 2 years ago
 - Predicts the level of noise and reverberation on your audiofiles☆167Updated 4 months ago
 - NeMo text processing for ASR and TTS☆382Updated this week
 - Kaldi-compatible online fbank extractor without external dependencies☆121Updated 3 weeks ago
 - Whisper fine-tuning event script to use multiple hf datasets☆32Updated 2 years ago
 - On-device voice activity detection (VAD) powered by deep learning☆232Updated last month
 - Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆127Updated 2 years ago
 - Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆252Updated 3 years ago
 - An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated last year
 - SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆91Updated last year
 - In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆103Updated 2 years ago
 - C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
 - Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated 2 years ago
 - Utilizes ONNX Runtime for audio denoising.☆87Updated 3 weeks ago
 - Fine-Tune Whisper with Transformers and PEFT☆57Updated last year