Srijith-rkr / KAUST-Whisper-AdapterView external linksLinks
INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!
☆43Sep 11, 2023Updated 2 years ago
Alternatives and similar repositories for KAUST-Whisper-Adapter
Users that are interested in KAUST-Whisper-Adapter are comparing it to the libraries listed below
Sorting:
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆112Aug 4, 2023Updated 2 years ago
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆267May 19, 2024Updated last year
- ☆86Jul 31, 2025Updated 6 months ago
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆38Nov 30, 2023Updated 2 years ago
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆43Mar 12, 2023Updated 2 years ago
- Official implementation of MelHuBERT☆68Oct 26, 2024Updated last year
- ☆17May 5, 2024Updated last year
- ☆18Mar 13, 2024Updated last year
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- ☆19Apr 28, 2023Updated 2 years ago
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆60Jul 1, 2024Updated last year
- ☆10Oct 20, 2022Updated 3 years ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Dec 4, 2023Updated 2 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- A Rust-based, SenseVoiceSmall☆23Jan 12, 2026Updated last month
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated 11 months ago
- Speech Security and Privacy Compendium - Mini☆10Jun 18, 2024Updated last year
- ☆10Oct 16, 2025Updated 4 months ago
- ☆32Dec 23, 2025Updated last month
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆36Dec 17, 2024Updated last year
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆53Jan 18, 2024Updated 2 years ago
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆32Mar 14, 2025Updated 11 months ago
- A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)☆30Jul 9, 2024Updated last year
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Dec 25, 2024Updated last year
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Oct 10, 2023Updated 2 years ago
- Experiments on AutoVC and WaveNet vocoder, compared against the Griffin Lim spectrogram inversion algorithm☆11Jun 18, 2020Updated 5 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- Fine-Tune Whisper with Transformers and PEFT☆58Nov 4, 2023Updated 2 years ago
- ☆37Jun 30, 2022Updated 3 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Dec 1, 2022Updated 3 years ago
- speech recognition using Kaldi framework☆12Dec 25, 2019Updated 6 years ago
- Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)☆15May 27, 2020Updated 5 years ago
- ☆15Aug 25, 2022Updated 3 years ago
- A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024☆20Jul 27, 2024Updated last year
- Single channel speech source separation by diffusion process (ICASSP 2023)☆124Mar 15, 2024Updated last year