fengredrum / finetune-whisper-loraView external linksLinks
Fine-Tune Whisper with Transformers and PEFT
☆58Nov 4, 2023Updated 2 years ago
Alternatives and similar repositories for finetune-whisper-lora
Users that are interested in finetune-whisper-lora are comparing it to the libraries listed below
Sorting:
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21May 26, 2025Updated 8 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- ☆32Dec 4, 2022Updated 3 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 10 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 6 months ago
- ☆30Jun 12, 2025Updated 8 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆183Jun 20, 2025Updated 7 months ago
- A casual and simple ChatGPT Python script that can run using terminal (as long as you have an API). Support Azure API.☆21May 3, 2025Updated 9 months ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- Diffusion Model for Voice Conversion☆69Mar 14, 2024Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Oct 6, 2023Updated 2 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated 10 months ago
- ONNX deployment of the CREPE pitch tracker☆26Oct 27, 2022Updated 3 years ago
- Latex template for CUHK PhD Thesis☆11Jun 29, 2025Updated 7 months ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆43Sep 11, 2023Updated 2 years ago
- A real-time visual analysis software for depth tracking in noisy oil and gas environments.☆12Jan 13, 2026Updated last month
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- silero-vad pytorch implement☆34Nov 23, 2024Updated last year
- ☆54Jul 1, 2024Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10May 1, 2025Updated 9 months ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Aug 2, 2025Updated 6 months ago
- ☆558Jul 10, 2024Updated last year
- ☆11Oct 14, 2023Updated 2 years ago
- ☆12Jul 23, 2024Updated last year
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 10 months ago
- Manage audio and video datasets☆33Feb 5, 2026Updated last week
- Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition☆15Sep 3, 2024Updated last year
- This repository follows papers and reports on discrete speech representation learning and speech tokenization methods for speech language…☆15Dec 1, 2023Updated 2 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆18May 17, 2023Updated 2 years ago
- ☆14Apr 2, 2023Updated 2 years ago
- ☆14Aug 19, 2024Updated last year
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆73Jun 8, 2025Updated 8 months ago
- The second generation of VoiceFixer, a toolkit for general speech restoration. *Not affiliated with the original VoiceFixer repo*☆21Nov 19, 2023Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Dec 1, 2022Updated 3 years ago