billzyx / WavBERTLinks
☆21Updated last year
Alternatives and similar repositories for WavBERT
Users that are interested in WavBERT are comparing it to the libraries listed below
Sorting:
- ☆52Updated 4 years ago
- Official implement of SpeechFormer written in Python (PyTorch).☆79Updated 2 years ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆52Updated last year
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆43Updated 2 years ago
- ☆17Updated 4 years ago
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆60Updated last year
- ☆111Updated 3 years ago
- ☆45Updated 2 years ago
- Audio Captioning datasets for PyTorch.☆125Updated 5 months ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆153Updated 4 years ago
- [TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation☆31Updated last year
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33Updated 3 years ago
- EMO-SUPERB submission☆50Updated 3 months ago
- SpeechFormer++ in PyTorch☆49Updated 2 years ago
- NAR-BERT-ASR☆10Updated 4 years ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆122Updated last year
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Updated last year
- ☆176Updated last year
- [IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer☆213Updated last month
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆40Updated 2 years ago
- 《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm☆82Updated 2 years ago
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆80Updated last year
- ☆42Updated 5 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27Updated 7 months ago
- Implementation of the paper "Multimodal Transformer With Learnable Frontend and Self Attention for Emotion Recognition" submitted to ICAS…☆26Updated 4 years ago
- The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion R…☆119Updated 4 years ago
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Updated 2 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆157Updated 3 years ago
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆27Updated 4 years ago
- Alzheimer's Dementia Recognition through Spontaneous Speech The ADReSSo Challenge☆12Updated 2 years ago