vb100 / whisper_ai_finetuneLinks
Fine-tune WhisperAI model to your language
☆21Updated last year
Alternatives and similar repositories for whisper_ai_finetune
Users that are interested in whisper_ai_finetune are comparing it to the libraries listed below
Sorting:
- [WIP] Scripts for fine-tuning Whisper☆220Updated 2 years ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆174Updated last year
- ☆199Updated 3 years ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆324Updated 2 years ago
- A non-native English corpus for pronunciation scoring task☆144Updated last year
- ONNX Inference of Pyannote Segmentation☆92Updated 7 months ago
- Fine-Tune Whisper with Transformers and PEFT☆57Updated last year
- Text to speech alignment using CTC forced alignment☆333Updated 4 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆107Updated 2 years ago
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Updated 4 years ago
- finetune llm part for spark-tts model☆103Updated 4 months ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- ☆38Updated 3 years ago
- ☆162Updated 8 months ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆176Updated 8 months ago
- Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/☆264Updated this week
- Finetune VITS and MMS using HuggingFace's tools☆161Updated last year
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆179Updated 2 years ago
- Onnx wrapper for espnet infrernce model☆168Updated 10 months ago
- ☆82Updated last week
- Predicts the level of noise and reverberation on your audiofiles☆156Updated last month
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆64Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- A curated list of awesome voice activity detection☆59Updated 8 months ago
- ☆49Updated 2 years ago
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆168Updated last year
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆44Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆253Updated last year
- Collection of pretrained models for the Montreal Forced Aligner☆158Updated last month