skakouros/s3prl_attentive_correlation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/skakouros/s3prl_attentive_correlation)

skakouros / s3prl_attentive_correlation

Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit

☆13

Alternatives and similar repositories for s3prl_attentive_correlation

Users that are interested in s3prl_attentive_correlation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ECNU-Cross-Innovation-Lab / ENT
View on GitHub
[ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition
☆28Apr 11, 2024Updated 2 years ago
circle-hit / MuCDN
View on GitHub
Code for COLING 2022 accepted paper titled "MuCDN: Mutual Conversational Detachment Network for Emotion Recognition in Multi-Party Conver…
☆10Jul 21, 2023Updated 3 years ago
scutcsq / DWFormer
View on GitHub
DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)
☆69Jul 8, 2024Updated 2 years ago
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 5 months ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
nii-yamagishilab / SSL-SAS
View on GitHub
Language independent SSL-based Speaker Anonymization system
☆20May 28, 2024Updated 2 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
Kyoto-University-Speech-and-Audio / feng-asr-ser
View on GitHub
☆10Sep 6, 2020Updated 5 years ago
mzarvandi / SER-wav2vec
View on GitHub
Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.
☆17Aug 8, 2021Updated 4 years ago
mechanicalsea / lighthubert
View on GitHub
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
☆73Sep 26, 2022Updated 3 years ago
vectominist / MiniASR
View on GitHub
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆53Dec 6, 2022Updated 3 years ago
NariFan2002 / AttA-NET
View on GitHub
ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION
☆14Sep 25, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HappyColor / DST
View on GitHub
Deformable Speech Transformer (DST)
☆35Aug 8, 2024Updated last year
voidful / wav2vec2-xlsr-multilingual-56
View on GitHub
56 language, 1 model Multilingual ASR
☆25Jul 25, 2021Updated 5 years ago
titu1994 / warprnnt_numba
View on GitHub
WarpRNNT loss ported in Numba CPU/CUDA for Pytorch
☆17Mar 11, 2022Updated 4 years ago
Vaibhavs10 / how-to-asr
View on GitHub
☆18Aug 29, 2022Updated 3 years ago
MrEdwards007 / WhisperTaskAcceleration
View on GitHub
Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization
☆25Oct 29, 2022Updated 3 years ago
ASolitaryMan / HFLEA
View on GitHub
FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION
☆23Dec 22, 2024Updated last year
edchengg / generative_model_speech
View on GitHub
Phone generation model/VAE/GAN/VAE+GAN
☆20Jun 26, 2018Updated 8 years ago
zzw922cn / wesinger2
View on GitHub
Synthesized singing voice demos of WeSinger 2 paper.
☆26Feb 20, 2023Updated 3 years ago
EnquanYang2022 / FRNet
View on GitHub
Feature_reconstruction_Network_for_RGB-D_Semantic_Segmentation
☆12Apr 28, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Jiaxin-Ye / TIM-Net_SER
View on GitHub
[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…
☆191May 15, 2024Updated 2 years ago
usc-sail / peft-ser
View on GitHub
[ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…
☆60Jul 1, 2024Updated 2 years ago
hyperion-ml / hyperion
View on GitHub
Python toolkit for speech processing
☆72Updated this week
Janie1996 / AV4SER
View on GitHub
PyTorch implementation for Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition
☆12Mar 20, 2022Updated 4 years ago
PINTO0309 / sne4onnx
View on GitHub
A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…
☆17Feb 24, 2026Updated 5 months ago
RanHao-cq / FLAMNet
View on GitHub
PyTorch implementation of the paper "FLAMNet: A Flexible Line Anchor Mechanism Network for Lane Detection".
☆18Aug 5, 2023Updated 2 years ago
DeqingYang / CISPER
View on GitHub
Codes for paper "Contextual Information and Commonsense Based Prompt for Emotion Recognition in Conversation" published in ECML-PKDD 2022…
☆17Jul 6, 2022Updated 4 years ago
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
isjwdu / DFADD
View on GitHub
Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset
☆16Apr 7, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
EIHW / MuSe2022
View on GitHub
☆28May 13, 2022Updated 4 years ago
HumeAI / competitions
View on GitHub
Hume AI ML Competitions
☆31Apr 7, 2026Updated 3 months ago
Sreyan88 / MMER
View on GitHub
Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition
☆83Mar 12, 2024Updated 2 years ago
zyh9929 / RL-EMO
View on GitHub
☆15Sep 2, 2023Updated 2 years ago
gchochla / Demux-MEmo
View on GitHub
[ICASSP'23] This repo contains code for the Demux & MEmo emotion recognition models (https://arxiv.org/abs/2210.15842), as well as code t…
☆23Jan 18, 2024Updated 2 years ago
lym0302 / paddlespeech_tts_cpp
View on GitHub
PaddleSpeech TTS cpp
☆42Mar 8, 2023Updated 3 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago