☆11Mar 22, 2023Updated 3 years ago
Alternatives and similar repositories for wav2vec-vc
Users that are interested in wav2vec-vc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Oct 20, 2022Updated 3 years ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- Streaming Vocos☆30Jun 10, 2025Updated 9 months ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- ☆21Jul 15, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆82Jan 22, 2025Updated last year
- ☆12Nov 7, 2024Updated last year
- ☆32Nov 24, 2024Updated last year
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆148Jan 15, 2024Updated 2 years ago
- ☆14Apr 18, 2019Updated 6 years ago
- ☆55Jul 16, 2025Updated 8 months ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- [T-IFS'24] Audio Multi-view Spoofing Detection Framework Based on Audio-Text-Emotion Correlations☆31Jul 31, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Sep 18, 2023Updated 2 years ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆155Oct 16, 2023Updated 2 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- This repository provides information on how to use the SINS database along with some example code. The SINS Dataset is composed of conti…☆23Dec 23, 2022Updated 3 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- ☆24Feb 28, 2023Updated 3 years ago
- This repository provides an implementation of the DPCCN model for single-channel speech separation. More details will be updated soon.☆13Dec 8, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆24Dec 12, 2024Updated last year
- ☆23Feb 2, 2022Updated 4 years ago
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆235Jul 3, 2024Updated last year
- An 16kHz implementation of HiFi-GAN for soft-vc.☆106Jul 19, 2023Updated 2 years ago
- ☆19Jun 28, 2022Updated 3 years ago
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆29Mar 3, 2022Updated 4 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- ☆37Nov 18, 2025Updated 4 months ago
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆56Dec 11, 2022Updated 3 years ago
- [ICASSP'23] This repo contains code for the Demux & MEmo emotion recognition models (https://arxiv.org/abs/2210.15842), as well as code t…☆23Jan 18, 2024Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Quantize transformers to any learned arbitrary 4-bit numeric format☆53Jan 25, 2026Updated 2 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago