INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!
☆42Sep 11, 2023Updated 2 years ago
Alternatives and similar repositories for KAUST-Whisper-Adapter
Users that are interested in KAUST-Whisper-Adapter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆111Aug 4, 2023Updated 2 years ago
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆270May 19, 2024Updated last year
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆38Nov 30, 2023Updated 2 years ago
- ☆17May 5, 2024Updated last year
- ☆88Jul 31, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆42Mar 12, 2023Updated 3 years ago
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- Official implementation of MelHuBERT☆70Feb 21, 2026Updated last month
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Oct 10, 2023Updated 2 years ago
- ☆18Mar 13, 2024Updated 2 years ago
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆59Jul 1, 2024Updated last year
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)☆29Jul 9, 2024Updated last year
- Speech Security and Privacy Compendium - Mini☆10Jun 18, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆35Dec 17, 2024Updated last year
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆27Dec 4, 2023Updated 2 years ago
- ☆11Oct 20, 2022Updated 3 years ago
- ☆19Apr 28, 2023Updated 2 years ago
- Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages☆11Jan 1, 2023Updated 3 years ago
- ☆33Dec 23, 2025Updated 3 months ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- ☆12Mar 23, 2026Updated 3 weeks ago
- Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)☆15May 27, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆34Mar 14, 2025Updated last year
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆54Jan 18, 2024Updated 2 years ago
- NVIDIA Riva SDK Demonstration for Feb 2022,2023 Developer Meetup☆10Jan 11, 2023Updated 3 years ago
- ☆37Jun 30, 2022Updated 3 years ago
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- Code for paper "Large Language Models are Efficient Learners of Noise-Robust Speech Recognition"☆140May 8, 2024Updated last year
- 分享在深蓝学院《语音识别:从入门到精通》第一期课程学习过程中完成的课后作业,供参考。☆22Sep 13, 2020Updated 5 years ago
- Fine-Tune Whisper with Transformers and PEFT☆58Nov 4, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆686Dec 25, 2024Updated last year
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Dec 25, 2024Updated last year
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆45Mar 15, 2024Updated 2 years ago
- ☆17Jul 22, 2024Updated last year
- Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"☆20May 24, 2023Updated 2 years ago