Jiaxin-Ye/Emo-DNA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jiaxin-Ye/Emo-DNA)

Jiaxin-Ye / Emo-DNA

[ACM MM 2023] Official PyTorch implementation of "Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Recognition".

☆12

Alternatives and similar repositories for Emo-DNA

Users that are interested in Emo-DNA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Jiaxin-Ye / TIM-Net_SER
View on GitHub
[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…
☆191May 15, 2024Updated 2 years ago
zxzhao0 / C2SER
View on GitHub
We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…
☆49Mar 3, 2025Updated last year
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
SCNU-RISLAB / CNN-Transformer-and-Multidimensional-Attention-Mechanism
View on GitHub
☆34Jul 17, 2025Updated last year
MengboLi / MS-SENet
View on GitHub
☆11Jul 16, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ECNU-Cross-Innovation-Lab / ENT
View on GitHub
[ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition
☆28Apr 11, 2024Updated 2 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
ECNU-Cross-Innovation-Lab / ShiftSER
View on GitHub
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
☆39Dec 18, 2023Updated 2 years ago
choiHkk / nix-tts
View on GitHub
End-To-End SpeechSynthesis system with knowledge distillation
☆18Jul 16, 2022Updated 4 years ago
HappyColor / Vesper
View on GitHub
A Compact and Effective Pretrained Model for Speech Emotion Recognition
☆55Apr 10, 2026Updated 3 months ago
bshall / dusted
View on GitHub
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Oct 2, 2024Updated last year
enesdoruk / DA-Mamba
View on GitHub
[ECAI] DA-Mamba: Domain Adaptive Hybrid Mamba-Transformer Based One-Stage Object Detection
☆17May 6, 2025Updated last year
xinshengwang / robpitch
View on GitHub
A pitch detection model trained to be robust against noise and reverberation environments.
☆27Jan 21, 2025Updated last year
tsun / APA
View on GitHub
Domain Adaptation with Adversarial Training on Penultimate Activations (AAAI 2023)
☆11Aug 1, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
lwang114 / GraphUnsupASR
View on GitHub
☆10Apr 17, 2024Updated 2 years ago
adelacvg / DPTTS
View on GitHub
An AR+AR TTS attempt.
☆18Jan 13, 2025Updated last year
keonlee9420 / Stepwise_Monotonic_Multihead_Attention
View on GitHub
PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to Enhancing Monotonicity for Robust Autoregressive Transformer …
☆39May 16, 2021Updated 5 years ago
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
View on GitHub
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…
☆60Apr 4, 2024Updated 2 years ago
Peihao-Xiang / MultiMAE-DER
View on GitHub
[IEEE ICPRS 2024 Oral] TensorFlow code implementation of "MultiMAE-DER: Multimodal Masked Autoencoder for Dynamic Emotion Recognition"
☆19Mar 13, 2026Updated 4 months ago
SLPcourse / Singing-Voice-Conversion
View on GitHub
Project of Singing Voice Conversion.
☆16Oct 27, 2023Updated 2 years ago
walker-hyf / ECSS
View on GitHub
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)
☆59Jun 20, 2024Updated 2 years ago
Tikai7 / DiTTO-TTS
View on GitHub
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
☆39Feb 11, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
light1726 / SpeechTripleNet
View on GitHub
The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"
☆33Nov 23, 2023Updated 2 years ago
yao-papercodes / AGLRLS
View on GitHub
Adaptive Global-Local Representation Learning and Selection for Cross-Domain Facial Expression Recognition (TMM 2024)
☆17Aug 13, 2024Updated last year
xiaomi1024 / code_SAMS
View on GitHub
☆13Jan 11, 2024Updated 2 years ago
AI-S2-Lab / FluentEditor
View on GitHub
[InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency
☆62Oct 23, 2024Updated last year
advanc3dUA / WohnungSuchen
View on GitHub
🏠🔍 Auto check for new apartments in Hamburg from various real estate provides
☆16Apr 15, 2026Updated 3 months ago
Choddeok / EmoSphere-TTS
View on GitHub
[INTERSPEECH 2024] The official implementation of EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for …
☆182Jul 16, 2026Updated last week
lixiangucas01 / GLAM
View on GitHub
This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…
☆49Apr 11, 2022Updated 4 years ago
koudounasalkis / Audio-Speech-Tutorial
View on GitHub
This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.
☆19Dec 20, 2023Updated 2 years ago
dingchaoyue / AcFormer
View on GitHub
☆29Aug 2, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
NikolaiKyhne / RWSAMamba-UNet
View on GitHub
Official repository for the paper "Exploring Resolution-Wise Shared Attention in Hybrid Mamba-U-Nets for Improved Cross-Corpus Speech Enh…
☆19May 5, 2026Updated 2 months ago
asuni / PitchSqueezer
View on GitHub
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆38Jan 17, 2024Updated 2 years ago
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
chenqi008 / V2C
View on GitHub
Pytorch implementation for “V2C: Visual Voice Cloning”
☆35Jan 28, 2023Updated 3 years ago
amitaybar / Domain-Adaptation-for-DoA-Estimation-in-Multipath-Channels-with-Interferences
View on GitHub
This is the code for the paper "Domain Adaptation for DoA Estimation in Multipath Channels with Interferences", A. Bar, J. S. Picard, I. …
☆15Oct 30, 2023Updated 2 years ago
scutcsq / DWFormer
View on GitHub
DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)
☆69Jul 8, 2024Updated 2 years ago
lucasjinreal / textfrontend
View on GitHub
单独维护的中文TTS
☆34Oct 28, 2022Updated 3 years ago