☆19Mar 2, 2024Updated 2 years ago
Alternatives and similar repositories for Zero-shot-FaceVC
Users that are interested in Zero-shot-FaceVC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆54Mar 2, 2023Updated 3 years ago
- ☆36Jun 16, 2023Updated 2 years ago
- ☆35Sep 24, 2024Updated last year
- Source code of APNet2, a vocoder☆58Nov 23, 2023Updated 2 years ago
- ☆59May 17, 2023Updated 2 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- Collection of self-supervised models for speaker and language recognition tasks.☆19Jan 18, 2022Updated 4 years ago
- Voice Face Association Learning Paper List☆17May 20, 2023Updated 2 years ago
- ☆11Feb 14, 2025Updated last year
- The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.☆31Feb 16, 2024Updated 2 years ago
- ☆24Dec 11, 2025Updated 3 months ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27May 30, 2025Updated 9 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆13Jul 22, 2024Updated last year
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Nov 13, 2020Updated 5 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 10 months ago
- ☆11Jun 14, 2024Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- 🏆🏅 Repository for the GEB team's winning solutions in the IEEE Hybrid Energy Forecasting and Trading Competition (HEFTCom).☆28Oct 4, 2025Updated 5 months ago
- AvatarShield: Visual Reinforcement Learning for Human-Centric Video Forgery Detection☆22Jun 3, 2025Updated 9 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆14Feb 4, 2026Updated last month
- Three-level Hierarchical Transformer Networks for Long-sequence and Multiple Clinical Documents Classification☆11Apr 7, 2022Updated 3 years ago
- [CVPR2026] Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning☆39Dec 22, 2025Updated 3 months ago
- Onnx compatible styletts2 code☆17Feb 28, 2026Updated 3 weeks ago
- Light-weight transfer learning framework for on-device speech and audio recognition using pre-trained image convolutional neural networks…☆18Apr 16, 2022Updated 3 years ago
- Qualifying Exam Preparing☆17May 7, 2025Updated 10 months ago
- Enhanced GPUstat-web☆10Oct 2, 2020Updated 5 years ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆179Apr 15, 2025Updated 11 months ago
- his code is a pytorch version for CycleFlow model in "CycleFlow: Purify Information Factors by Cycle Loss"☆15Jan 14, 2022Updated 4 years ago
- [ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"☆98Nov 14, 2024Updated last year
- StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation☆43Jun 6, 2025Updated 9 months ago
- VoiceLDM: Text-to-Speech with Environmental Context☆192Aug 9, 2024Updated last year
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆475May 19, 2025Updated 10 months ago
- 一个类iMessage风格的基于“Themebox”插件的WeChat主题(持续维护更新、优化,欢迎Star👏)☆16Dec 30, 2024Updated last year
- Code for calculate DNS_MOS.☆43Dec 18, 2022Updated 3 years ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Multimodal Speech Recognition for phoneme level prediction using Audio-Visual data from TCDTIMIT dataset implementing RNNs with LSTMs for…☆15Jul 27, 2023Updated 2 years ago
- The pytorch implement of MOSNet☆16Dec 22, 2021Updated 4 years ago
- Exploring Binary Classification Loss for Speaker Verification☆18Jul 18, 2023Updated 2 years ago
- used to evaluate wavenet vocoder by rmse f0, MCD, rmse ap...☆15Jan 20, 2020Updated 6 years ago