N/A
☆187May 19, 2022Updated 3 years ago
Alternatives and similar repositories for BinauralSpeechSynthesis
Users that are interested in BinauralSpeechSynthesis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Aug 13, 2023Updated 2 years ago
- Official implementation of SawSing (ISMIR'22)☆273Aug 28, 2022Updated 3 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆127Jul 16, 2020Updated 5 years ago
- A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf☆372Nov 5, 2021Updated 4 years ago
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆71Jul 8, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆44Dec 17, 2020Updated 5 years ago
- Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis☆232Dec 10, 2025Updated 3 months ago
- Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"☆200Oct 31, 2023Updated 2 years ago
- A pytroch implementation of the FB-MelGAN☆90May 26, 2020Updated 5 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆211May 30, 2025Updated 10 months ago
- 2.5D visual sound☆118Jul 25, 2023Updated 2 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆144Jul 8, 2021Updated 4 years ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,043Jul 5, 2023Updated 2 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆157Jul 2, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆159Jul 16, 2022Updated 3 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆270Jul 29, 2023Updated 2 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆416Aug 29, 2023Updated 2 years ago
- An Open-source Streaming High-fidelity Neural Audio Codec☆502Mar 4, 2025Updated last year
- ☆25Mar 12, 2022Updated 4 years ago
- An unofficial implement of autoregressive vocoder Multiband-WaveRNN. Audio samples in https://rongjiehuang.github.io/Multiband-WaveRNN/☆28Feb 12, 2021Updated 5 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation☆80Feb 24, 2021Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation of Meta-StyleSpeech and StyleSpeech☆252Feb 9, 2022Updated 4 years ago
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆139May 8, 2022Updated 3 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!