bagustris/s3prl-ser

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bagustris/s3prl-ser)

bagustris / s3prl-ser

S3PRL for Speech Emotion Recognition (see s3prl > downstream)

☆15

Alternatives and similar repositories for s3prl-ser

Users that are interested in s3prl-ser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
Neclow / SERAB
View on GitHub
SERAB: a multi-lingual benchmark for speech emotion recognition
☆28Dec 16, 2022Updated 3 years ago
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
dayanavivolab / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
bagustris / SER_ICSigSys2019
View on GitHub
Repository of code for Speech emotion recognition using voiced speech and attention model, submitted to ICSigSys 2019
☆13Jan 6, 2020Updated 6 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
babe269 / performant
View on GitHub
A toolset for easy formant extraction and visualization from wav files and TTS models
☆33Sep 2, 2022Updated 3 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
Labbeti / conette-audio-captioning
View on GitHub
CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding
☆23Dec 17, 2025Updated 7 months ago
indonesian-nlp / wav2vec2-indonesian
View on GitHub
☆20Apr 5, 2021Updated 5 years ago
bagustris / w2v2-vad
View on GitHub
A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition
☆22Aug 9, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
Speech-Lab-IITM / CCC-wav2vec-2.0
View on GitHub
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆23Mar 18, 2024Updated 2 years ago
bagustris / coughkit
View on GitHub
A python toolkit to detect, segment, and count coughs
☆15Jul 1, 2026Updated 3 weeks ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
vectominist / MiniASR
View on GitHub
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆53Dec 6, 2022Updated 3 years ago
MrEdwards007 / WhisperTaskAcceleration
View on GitHub
Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization
☆25Oct 29, 2022Updated 3 years ago
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
audeering / w2v2-age-gender-how-to
View on GitHub
How to use our public wav2vec2 age and gender model
☆55Sep 4, 2023Updated 2 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 4 years ago
utter-project / mHuBERT-147-scripts
View on GitHub
Collection of scripts from mHuBERT-147.
☆35Nov 19, 2024Updated last year
ffxiong / uaspeech
View on GitHub
Baseline kaldi script for UA-SPEECH corpus
☆32Oct 16, 2024Updated last year
hbwu-ntu / EmoCtrlTTS-Eval
View on GitHub
☆19Aug 23, 2024Updated last year
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago
bagustris / ssl-ser
View on GitHub
Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"
☆10Mar 15, 2023Updated 3 years ago
ShovalMessica / NAST
View on GitHub
Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…
☆46Jul 2, 2024Updated 2 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jasonppy / FaST-VGS-Family
View on GitHub
Transformer-based visually grounded speech models
☆19Sep 22, 2022Updated 3 years ago
jindongwang / EasyEspnet
View on GitHub
Making Espnet easier to use
☆54Apr 9, 2021Updated 5 years ago
shaokai1209 / MDSA
View on GitHub
[IEEE, TASLP, 2023] The code of the paper "Multi-Source Discriminant Subspace Alignment for Cross-Domain Speech Emotion Recognition".
☆19Sep 27, 2024Updated last year
ashi-ta / speechGLUE
View on GitHub
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
☆13Jun 2, 2023Updated 3 years ago
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
minguinho26 / Prefix_AAC_ICASSP2023
View on GitHub
Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"
☆30Dec 6, 2023Updated 2 years ago
poleval / 2021-punctuation-restoration
View on GitHub
PolEval 2021 Task 1
☆15Jun 28, 2022Updated 4 years ago