fengredrum / finetune-whisper-lora
Fine-Tune Whisper with Transformers and PEFT
☆38Updated last year
Related projects ⓘ
Alternatives and complementary repositories for finetune-whisper-lora
- ☆17Updated 3 months ago
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆61Updated 4 months ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆36Updated last year
- Finetuning VITS Efficiently☆32Updated last year
- ConMamba for Automatic Speech Recognition☆45Updated 3 months ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆118Updated 3 weeks ago
- ☆59Updated 2 months ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆126Updated 8 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆81Updated this week
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆72Updated 5 months ago
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆84Updated last month
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆56Updated last year
- FlashSpeech: Efficient Zero-Shot Speech Synthesis☆97Updated 2 months ago
- UTokyo-SaruLab MOS Prediction System☆97Updated 2 weeks ago
- Update ASR paper everyday☆54Updated this week
- Huawei Grad-TTS for Chinese☆45Updated last year
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆32Updated last year
- The official implementation of EmoSphere-TTS☆85Updated 3 months ago
- All generative model in one for better TTS model☆66Updated 2 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆84Updated last month
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆119Updated 2 years ago
- Unofficial implementation of wavenext vocoder☆32Updated 2 months ago
- ☆47Updated 3 weeks ago
- A Survey of Spoken Dialogue Models (60 pages)☆97Updated this week
- Predicts the level of noise and reverberation on your audiofiles☆138Updated 6 months ago
- The official implementation of EmoSphere++☆41Updated 2 weeks ago
- ☆66Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆71Updated 7 months ago
- Official Code for ParrotTTS☆43Updated last month