declare-lab / speech-adaptersLinks

Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech understanding

☆43

Alternatives and similar repositories for speech-adapters

Users that are interested in speech-adapters are comparing it to the libraries listed below

Sorting:

sinhat98 / adapter-wavlm
☆43Updated 2 years ago
ga642381 / Speech-Prompts-Adapters
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
☆111Updated 2 years ago
archiki / Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆48Updated 7 months ago
ankitapasad / layerwise-analysis
Layer-wise analysis of self-supervised pre-trained speech representations
☆113Updated 9 months ago
ga642381 / SpeechPrompt
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…
☆102Updated 3 months ago
ga642381 / SpeechPrompt-v2
《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm
☆81Updated last year
Srijith-rkr / KAUST-Whisper-Adapter
INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!
☆36Updated last year
HuangZiliAndy / SSL_for_multitalker
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆30Updated 2 years ago
tango4j / llm_speaker_tagging
SLT 2024 Challenge: Post-ASR-Speaker-Tagging
☆14Updated last year
EMOsuperb / EMO-SUPERB-submission
EMO-SUPERB submission
☆45Updated 11 months ago
mutiann / speech_rankings
A CSRankings-like index for speech researchers
☆34Updated 9 months ago
NaoyukiKanda / LibriSpeechMix
☆36Updated 4 years ago
tobefans / LSSED
☆51Updated 3 years ago
mechanicalsea / lighthubert
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
☆74Updated 2 years ago
sky1456723 / Pytorch-MBNet
A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK
☆60Updated 3 years ago
jindongwang / EasyEspnet
Making Espnet easier to use
☆56Updated 4 years ago
BUTSpeechFIT / EEND_dataprep
☆57Updated 4 months ago
mkunes / w2v2_audioFrameClassification
wav2vec2 audio classification for prosodic boundary detection and other tasks
☆41Updated last year
Hertin / WavPrompt
☆37Updated 3 years ago
HarunoriKawano / BEST-RQ
Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.
☆82Updated 2 years ago
mct10 / CoBERT
Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
☆47Updated last year
YUCHEN005 / DPSL-ASR
Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"
☆42Updated 2 years ago
Alexander-H-Liu / dinosr
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning
☆49Updated last year
WangHelin1997 / SpeechTasks
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…
☆76Updated last year
mispchallenge / misp2022_baseline
☆30Updated 2 years ago
Splend1d / T5lephone
Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
☆19Updated 2 years ago
vectominist / spin
Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…
☆56Updated 2 years ago
luferrer / ConfidenceIntervals
Confidence interval computation for evaluation in machine learning using the bootstrapping approach
☆87Updated last year
choijeongsoo / utut
[TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation
☆31Updated 11 months ago
Lhx94As / PHO-LID
PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification
☆21Updated last year