bagustris/ssl-ser

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bagustris/ssl-ser)

bagustris / ssl-ser

Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"

☆10

Alternatives and similar repositories for ssl-ser

Users that are interested in ssl-ser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago
SatenikS / self-supervised-voice-emotion-recognition
View on GitHub
☆12Mar 25, 2021Updated 5 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
MTG / Podcastmix
View on GitHub
PodcastMix A dataset for separating music and speech in podcasts.
☆44Aug 20, 2024Updated last year
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jplhughes / emotion_detection_cpc
View on GitHub
Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).
☆43Feb 16, 2022Updated 4 years ago
jinny1208 / All-About-Speech
View on GitHub
☆14Apr 2, 2023Updated 3 years ago
KrishnaDN / BERTphone
View on GitHub
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Dec 10, 2020Updated 5 years ago
atharva-lipare / speech-to-senti-report
View on GitHub
Python script to generate a PDF report based on sentiment analysis, words usage, personality insights, tone analysis and facial expressio…
☆12Aug 1, 2021Updated 4 years ago
Speech-Lab-IITM / CCC-wav2vec-2.0
View on GitHub
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆23Mar 18, 2024Updated 2 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
liyunfei0411 / labelimg-master
View on GitHub
☆14Apr 21, 2026Updated 3 months ago
YaoZhang93 / Semi-supervised-Cardiac-Image-Segmentation-via-Label-Propagation-and-Style-Transfer
View on GitHub
[MICCAI 2020 Challenge] This is the code for the 2nd-place method of MICCAI 2020 Multi-Centre, Multi-Vendor & Multi-Disease Cardiac Image…
☆11Nov 21, 2022Updated 3 years ago
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
poleval / 2021-punctuation-restoration
View on GitHub
PolEval 2021 Task 1
☆15Jun 28, 2022Updated 4 years ago
laurensw75 / docker-Kaldi-NL
View on GitHub
Docker for building an environment for Dutch online and offline ASR.
☆12Feb 2, 2021Updated 5 years ago
pkufool / simple-wer
View on GitHub
A simple command line tool to calculate WER for ASR.
☆14Oct 14, 2024Updated last year
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 4 months ago
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
yasumasaonoe / ecbd
View on GitHub
☆11Apr 23, 2023Updated 3 years ago
uiuctml / GOAT
View on GitHub
[JMLR] Gradual Domain Adaptation: Theory and Algorithms
☆11Jan 14, 2025Updated last year
yuhanghe01 / Sound3DVDet
View on GitHub
Code for WACV24 work for multiview acoustic-visual detection
☆13Mar 22, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
a791702141 / SSG
View on GitHub
This project is the official implementation of ``Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation'' in PyTorch, wh…
☆12Nov 4, 2022Updated 3 years ago
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
View on GitHub
☆12Jun 10, 2021Updated 5 years ago
liviaellen / engagementdetector
View on GitHub
Projects Student Engagement Detection System in E-Learning Environment using OpenCV and CNN
☆17Dec 6, 2023Updated 2 years ago
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
dan-wells / kiss-aligner
View on GitHub
Simple Kaldi recipe for forced alignment
☆11Jul 16, 2023Updated 3 years ago
1core2life / simple-twitch-chat-replay-downloader
View on GitHub
it makes txt file with chat written by twitch.tv replay.
☆13Apr 3, 2023Updated 3 years ago
Xiaobin-Rong / lite-rtse
View on GitHub
An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement
☆14Nov 19, 2023Updated 2 years ago
nainiayoub / emotion-classifier-web-app
View on GitHub
Logistic regression, text emotion classifier web application (with Streamlit), from data preprocession to model productionizing and deplo…
☆15Oct 7, 2025Updated 9 months ago
JabuMlDev / Speaker-VGG-CCT
View on GitHub
Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…
☆25Feb 17, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
gbegus / DeepPhonologyTool
View on GitHub
Train a fiwGAN or ciwGAN model using your own training data
☆14Oct 13, 2022Updated 3 years ago
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
SWivid / AUV
View on GitHub
An All-in-One Speech, Sound, Music Codec with Single Nested Codebook
☆28Oct 11, 2025Updated 9 months ago
VITA-Group / Audio-Lottery
View on GitHub
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆32Apr 8, 2022Updated 4 years ago
noiseux1523 / NIST-SRE-2019
View on GitHub
Score Normalization for NIST 2019 Speaker Recognition Evaluation
☆10Nov 8, 2019Updated 6 years ago
aascode / Speech-Emotion-Recognition-2
View on GitHub
Speech emotion recognition using LSTM, SVM and MLP | 语音情感识别
☆10Jul 1, 2019Updated 7 years ago