wangtianrui / PM-EVCLinks

This is the official implement of A Controllable Emotion Voice Conversion Framework with Pre-trained Speech Representations

☆4

Alternatives and similar repositories for PM-EVC

Users that are interested in PM-EVC are comparing it to the libraries listed below

Sorting:

lijin0120 / CELSDS
A Chinese Expressive Long-dialogue Speech Dataset with Scripts
☆20Updated 9 months ago
wangtianrui / ProgRE
☆26Updated 10 months ago
chaufanglin / Normal2Whisper
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆11Updated 9 months ago
wyw97 / DENSE
ICASSP2025Dynamic Embedding Causal Target Speech Extraction
☆3Updated 4 months ago
Beilong-Tang / TSELM
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆47Updated 3 months ago
light1726 / SpeechTripleNet
The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"
☆34Updated last year
hmartelb / avlit
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…
☆19Updated last year
lucadellalib / discrete-wavlm-codec
A neural speech codec based on discrete WavLM representations
☆24Updated 11 months ago
kjw11 / Speaker-Aware-CTC
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆20Updated 2 months ago
XiangLi2022 / CM-TTS
[Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers a…
☆67Updated last year
walker-hyf / ECSS
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)
☆56Updated last year
walker-hyf / FCTalker
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)
☆25Updated last year
ddlBoJack / MT4SSL
[INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…
☆44Updated last year
walker-hyf / NCSSD
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆61Updated 9 months ago
scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…
☆61Updated last year
mispchallenge / misp2021_baseline
☆29Updated 3 years ago
HappyColor / DrawSpeech_PyTorch
☆19Updated 10 months ago
vivian556123 / NeurIPS2024-CoVoMix
Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations
☆59Updated 6 months ago
xcmyz / ConvTasNet4BasisMelGAN
This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.
☆21Updated 4 years ago
pengzhendong / streaming-vocos
Streaming Vocos
☆29Updated 2 months ago
Audio-Foundation-Models / ConversationTTS
☆79Updated last month
AI-S2-Lab / FluentEditor
[InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency
☆55Updated 9 months ago
sungnyun / ARMHuBERT
(Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT
☆40Updated 11 months ago
zruiii / QwenAudioSFT
The repoduction codes for Qwen-Audio Fine-tuning
☆45Updated 11 months ago
RicherMans / SAT
Streaming Audiotransformers for online Audio tagging
☆46Updated last year
JusperLee / Look2hear
A toolkit for researchers in the multimodal sound separation.
☆16Updated last year
ZhikangNiu / A-DMA
[INTERSPEECH 2025]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"
☆49Updated last month
mubtasimahasan / DM-Codec
Source code for DM-Codec.
☆46Updated 2 months ago
WangHelin1997 / DuTa-VC
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…
☆37Updated last year
lavendery / AudioComposer
☆22Updated 10 months ago