fengredrum / finetune-whisper-lora
Fine-Tune Whisper with Transformers and PEFT
☆37Updated last year
Related projects ⓘ
Alternatives and complementary repositories for finetune-whisper-lora
- ☆17Updated 3 months ago
- ☆20Updated 8 months ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆72Updated 5 months ago
- Finetuning VITS Efficiently☆32Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆44Updated 2 months ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆119Updated 2 years ago
- Huawei Grad-TTS for Chinese☆45Updated last year
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆81Updated last week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆57Updated 3 weeks ago
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆128Updated 3 weeks ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆114Updated last week
- ConMamba for Automatic Speech Recognition☆44Updated 2 months ago
- Clustering-based methods for overlapping diarization☆68Updated 9 months ago
- Official Code for ParrotTTS☆41Updated 3 weeks ago
- ☆45Updated last week
- FlashSpeech: Efficient Zero-Shot Speech Synthesis☆93Updated last month
- ☆65Updated last year
- Official implementation of Vec-Tok Speech☆93Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆43Updated 2 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆74Updated last year
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆56Updated last year
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆48Updated 4 months ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆121Updated 8 months ago
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆79Updated last month
- ☆27Updated 11 months ago
- ☆19Updated last year
- ☆10Updated 7 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year