leibniz-future-lab/SelfDistill-SER

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/leibniz-future-lab/SelfDistill-SER)

leibniz-future-lab / SelfDistill-SER

☆18

Alternatives and similar repositories for SelfDistill-SER

Users that are interested in SelfDistill-SER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yl4579 / SLMGAN
View on GitHub
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs
☆16Jul 19, 2023Updated 3 years ago
ECNU-Cross-Innovation-Lab / ShiftSER
View on GitHub
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
☆39Dec 18, 2023Updated 2 years ago
lingjzhu / spoken_sent_embedding
View on GitHub
Unsupervised spoken sentence embeddings
☆14Dec 14, 2022Updated 3 years ago
JSALT-2022-SSL / superb-prosody
View on GitHub
☆31Jul 13, 2023Updated 3 years ago
zhaoren91 / awesome-heart-sound-analysis
View on GitHub
Awesome Heart Sound Analysis - A Survey
☆19Jul 13, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Vincent-ZHQ / CA-MSER
View on GitHub
Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information
☆163Nov 27, 2023Updated 2 years ago
GATECH-EIC / S3-Router
View on GitHub
[NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…
☆17Sep 19, 2023Updated 2 years ago
ga642381 / RobustVC
View on GitHub
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Sep 27, 2022Updated 3 years ago
Janie1996 / MSRFG
View on GitHub
The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations
☆11Jan 17, 2023Updated 3 years ago
GeWanying / shap-anti-spoofing
View on GitHub
This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…
☆12Jan 24, 2024Updated 2 years ago
Allen0307 / AdapterBias
View on GitHub
Code for the Findings of NAACL 2022(Long Paper): AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NL…
☆18May 4, 2022Updated 4 years ago
josebeo2016 / BTS-Encoder-ASVspoof
View on GitHub
Synthesis speech detection based on Breathing-Talking-Silence sounds
☆21Sep 3, 2025Updated 10 months ago
ankitapasad / layerwise-analysis
View on GitHub
Layer-wise analysis of self-supervised pre-trained speech representations
☆135Oct 18, 2024Updated last year
huashen218 / convxai
View on GitHub
CSCW 2023 Best Demo Award: Conversational AI Explanations to Support Human-AI Scientific Writing
☆15Jun 25, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
thuhcsi / DiffVar
View on GitHub
☆30Aug 12, 2023Updated 2 years ago
W-Wu / DEER
View on GitHub
☆12Aug 25, 2023Updated 2 years ago
jasonppy / FaST-VGS-Family
View on GitHub
Transformer-based visually grounded speech models
☆19Sep 22, 2022Updated 3 years ago
voidful / asrp
View on GitHub
ASR text preprocessing utility
☆21Aug 5, 2024Updated last year
B06901052 / DeepSpeed
View on GitHub
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆13Oct 11, 2022Updated 3 years ago
b04901014 / UUVC
View on GitHub
Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…
☆83Jan 7, 2023Updated 3 years ago
Splend1d / T5lephone
View on GitHub
Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
☆19Nov 29, 2022Updated 3 years ago
slp-rl / SLM-Discrete-Representations
View on GitHub
This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…
☆20Jan 3, 2023Updated 3 years ago
sinhat98 / adapter-wavlm
View on GitHub
☆46Feb 16, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
voidful / llm-codec
View on GitHub
LLM-Codec: Neural Audio Codec Meets Language Model Objectives
☆23May 3, 2026Updated 2 months ago
asuni / PitchSqueezer
View on GitHub
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆38Jan 17, 2024Updated 2 years ago
piotrkawa / deepfake-whisper-features
View on GitHub
Implementation of the paper "Improved DeepFake Detection Using Whisper Features"
☆117Apr 9, 2025Updated last year
yu-haoyuan / fd-badcat
View on GitHub
fd-sds
☆21Apr 8, 2026Updated 3 months ago
howard1337 / S2VC
View on GitHub
☆100Jul 22, 2021Updated 5 years ago
ddlBoJack / MT4SSL
View on GitHub
[INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…
☆45Mar 25, 2024Updated 2 years ago
YoungSeng / SRD-VC
View on GitHub
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
☆119Feb 7, 2024Updated 2 years ago
kamperh / vqwordseg
View on GitHub
Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.
☆39May 5, 2026Updated 2 months ago
cylin-cmlab / GCT-Prediction
View on GitHub
This is the official supplementary document for the GCT data and its prediction task.
☆10Feb 19, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ECNU-Cross-Innovation-Lab / ENT
View on GitHub
[ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition
☆28Apr 11, 2024Updated 2 years ago
biggytruck / SpeechSplit2
View on GitHub
Official implementation of SpeechSplit2
☆135Oct 22, 2022Updated 3 years ago
youcaiSUN / MuSe-Wild_2020
View on GitHub
☆12Aug 24, 2020Updated 5 years ago
b04901014 / FT-w2v2-ser
View on GitHub
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
☆153Oct 26, 2021Updated 4 years ago
nervjack2 / MelHuBERT
View on GitHub
Official implementation of MelHuBERT
☆70Feb 21, 2026Updated 5 months ago
ga642381 / Taiwanese-Speech-Synthesis
View on GitHub
Taiwanese Speech Synthesis with Tacotron2
☆26Oct 2, 2022Updated 3 years ago
KunZhou9646 / controllable_evc_code
View on GitHub
This is the code for controllable EVC framework for seen and unseen emotion generation.
☆45Nov 3, 2021Updated 4 years ago