Voice Conversion Experiments for THUHCSI Course : <Digital Processing of Speech Signals>
☆18Nov 27, 2024Updated last year
Alternatives and similar repositories for dpss-exp3-VC-BNF
Users that are interested in dpss-exp3-VC-BNF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of our paper at ACL 2023: Pre-training Multi-party Dialogue Models with Latent Discourse Inference☆10Jul 10, 2023Updated 2 years ago
- The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension☆12Oct 23, 2022Updated 3 years ago
- ☆11May 7, 2022Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- target speaker verification (tSV), ts-vector, universal speaker verification for single- and multi-talker speech☆15Jan 26, 2021Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Neural Homomorphic Vocoder optimized for singing voice synthesis☆23Mar 20, 2026Updated last week
- ☆15Nov 11, 2024Updated last year
- BEGANSing - Korean SVS + SVC + AudioSR☆11Feb 17, 2024Updated 2 years ago
- Codes and data for EMNLP 2021 paper "Self- and Pseudo-self-supervised Prediction of Speaker and Key-utterance for Multi-party Dialogue Re…☆16Oct 15, 2022Updated 3 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- Ke-Omni-R is an advanced audio reasoning model and achieved SOTA on MMAU☆60Jun 11, 2025Updated 9 months ago
- [INTERSPEECH 2025] The official implementation of DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for…☆16Sep 7, 2025Updated 6 months ago
- ☆14May 20, 2025Updated 10 months ago
- 单独维护的中文TTS☆34Oct 28, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- The official code of our paper at EMNLP 2022: Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Mo…☆16Feb 17, 2023Updated 3 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- ☆24Apr 10, 2023Updated 2 years ago
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval☆20May 10, 2024Updated last year
- 儿童故事常识推理与寓意理解评测(Commonsense Reasoning and Moral Understanding Evaluation in Children's Stories,CRMU)☆18Oct 22, 2024Updated last year
- ICASSP2022 TTS&VC Summary☆14Jun 9, 2022Updated 3 years ago
- The official implement of Freeze-Omni.☆15Jul 10, 2025Updated 8 months ago
- ☆11Dec 24, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The code for the paper "Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval" (WWW'22, Oral).☆17Mar 8, 2022Updated 4 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- ☆20Jun 5, 2022Updated 3 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- 22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。☆14Aug 7, 2022Updated 3 years ago
- singing voice conversion without f0☆23May 10, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ICCV'23 Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval☆19Aug 22, 2025Updated 7 months ago
- [AAAI 2023] Contrastive Masked Autoencoders for Self-Supervised Video Hashing☆27Jul 4, 2023Updated 2 years ago
- Official github repo for ICCV2023 paper 'Multi-event Video-Text Retrieval'☆19Feb 16, 2024Updated 2 years ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆80Sep 22, 2022Updated 3 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- ☆51Mar 5, 2026Updated 3 weeks ago