Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)
☆12Nov 14, 2024Updated last year
Alternatives and similar repositories for FastAdaSP
Users that are interested in FastAdaSP are comparing it to the libraries listed below
Sorting:
- ☆22Jun 24, 2024Updated last year
- ☆15Jul 4, 2024Updated last year
- ☆17Jul 22, 2024Updated last year
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- Getting confidences from any end-to-end systems☆11May 24, 2023Updated 2 years ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- Implementation of algorithms for refinement of direction of arrival estimators by optimization☆16Jun 2, 2021Updated 4 years ago
- ☆17May 5, 2024Updated last year
- ☆15Mar 25, 2024Updated last year
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆16Oct 22, 2022Updated 3 years ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Sep 30, 2022Updated 3 years ago
- Light-weight transfer learning framework for on-device speech and audio recognition using pre-trained image convolutional neural networks…☆18Apr 16, 2022Updated 3 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- ☆22Jun 30, 2021Updated 4 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆28Dec 4, 2024Updated last year
- Gemma-based Multilingual Machine Translation Models☆36Feb 13, 2026Updated 2 weeks ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Aug 27, 2023Updated 2 years ago
- ☆32May 17, 2024Updated last year
- Keyword spotting by Kaldi library☆26Oct 26, 2016Updated 9 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆31Apr 29, 2022Updated 3 years ago
- Keyword Search Recipe for Subword ASR☆30Jul 12, 2019Updated 6 years ago
- Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット☆33Apr 3, 2022Updated 3 years ago
- Implementation of Google's USM speech model in Pytorch☆35Feb 7, 2026Updated 3 weeks ago
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆40Mar 13, 2024Updated last year
- ☆37Jun 30, 2022Updated 3 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Enhanced Explainable Neural Network☆10Dec 25, 2021Updated 4 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆40Dec 30, 2020Updated 5 years ago
- Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny☆15Oct 30, 2025Updated 4 months ago
- Code for the paper "SMACE: A New Method for the Interpretability of Composite Decision Systems", ECML 2022☆15Apr 17, 2023Updated 2 years ago
- Detecting and correction dysfluencies/stuttering/stammering in audio files☆10Apr 23, 2023Updated 2 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- WavReward: Spoken Dialogue Models With Generalist Reward Evaluators☆54May 15, 2025Updated 9 months ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆50May 14, 2025Updated 9 months ago
- Whisper finetuning☆16Apr 9, 2025Updated 10 months ago