wngh1187 / IPETLinks

Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS

☆10

Alternatives and similar repositories for IPET

Users that are interested in IPET are comparing it to the libraries listed below

Sorting:

umbertocappellazzo / PETL_AST
This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…
☆36Updated 11 months ago
declare-lab / speech-adapters
Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…
☆43Updated 2 years ago
magnumresearchgroup / Fastaudio
FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge
☆46Updated 2 years ago
kyegomez / AudioFlamingo
Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…
☆40Updated 5 months ago
felixgontier / dcase-2023-baseline
☆14Updated 2 years ago
sinhat98 / adapter-wavlm
☆43Updated 2 years ago
XinhaoMei / ACT
Source code for the paper 'Audio Captioning Transformer'
☆54Updated 3 years ago
WangHelin1997 / MaskSpec
The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
☆42Updated 7 months ago
WangHelin1997 / SpecAugment-plus
A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
☆34Updated 4 years ago
haoheliu / diffres-python
Learning differentiable temporal resolution on time-series data.
☆36Updated 2 years ago
mispchallenge / misp2022_baseline
☆30Updated 2 years ago
JinhuaLiang / lam4fsl
An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"
☆31Updated 2 years ago
sungnyun / ARMHuBERT
(Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT
☆40Updated 10 months ago
mct10 / CoBERT
Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
☆47Updated last year
cuhealthybrains / MT-LLM
The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"
☆42Updated 3 months ago
msplabresearch / MSP-Podcast_Challenge
MSP-Podcast Challenge Baseline Code
☆24Updated last year
theolepage / sslsv
Toolkit for training and evaluating Self-Supervised Learning (SSL) frameworks for Speaker Verification (SV).
☆26Updated last week
joannahong / AV-RelScore
Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling an…
☆34Updated 2 years ago
Labbeti / aac-metrics
Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.
☆54Updated 2 weeks ago
LetianLee / Speech-Emotion-Recognition
An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …
☆34Updated 3 years ago
GasserElbanna / serab-byols
(Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.
☆27Updated last year
ga642381 / SpeechPrompt
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…
☆101Updated 3 months ago
Srijith-rkr / KAUST-Whisper-Adapter
INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!
☆36Updated last year
zaocan666 / DyViSE
Dynamic vision-guided speaker embedding for audio-visual speaker diarization
☆11Updated 3 years ago
sivannavis / samo
SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING
☆39Updated 2 years ago
andi611 / Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
☆54Updated 2 years ago
AlanBaade / MAE-AST-Public
Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer
☆88Updated 3 years ago
Yuto-Matsunaga / Prompt_Tuning_for_Audio_Deepfake_Detection
☆11Updated 8 months ago
usc-sail / peft-ser
PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models (…
☆60Updated last year
HarunoriKawano / BEST-RQ
Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.
☆81Updated 2 years ago