fengredrum / finetune-whisper-lora
Fine-Tune Whisper with Transformers and PEFT
☆51Updated last year
Alternatives and similar repositories for finetune-whisper-lora:
Users that are interested in finetune-whisper-lora are comparing it to the libraries listed below
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆85Updated 3 months ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆81Updated last month
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆80Updated last year
- ☆21Updated 6 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆58Updated 2 weeks ago
- ConMamba for Automatic Speech Recognition☆57Updated 6 months ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆36Updated 2 years ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆39Updated last year
- ☆21Updated last month
- Update ASR paper everyday☆143Updated this week
- Unofficial implementation of wavenext vocoder☆42Updated 6 months ago
- Finetuning VITS Efficiently☆32Updated last year
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆61Updated 2 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆50Updated 6 months ago
- ☆31Updated 11 months ago
- The official implementation of EmoSphere-TTS☆107Updated last month
- Clustering-based methods for overlapping diarization☆76Updated last year
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆125Updated 2 months ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆53Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆47Updated 8 months ago
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆84Updated 8 months ago
- Simple voice activity detection (VAD) algorithm in Python☆12Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆115Updated 2 weeks ago
- ☆71Updated last year
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆36Updated last year
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆144Updated 11 months ago
- All generative model in one for better TTS model☆66Updated 5 months ago
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆74Updated 8 months ago
- The official implementation of EmoSphere++☆73Updated last month