muhammad-ahmed-ghani / svoice_demo
A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.
☆36Updated last year
Alternatives and similar repositories for svoice_demo:
Users that are interested in svoice_demo are comparing it to the libraries listed below
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆91Updated 3 months ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆120Updated 2 years ago
- SelfRemaster: SSL Speech Restoration☆88Updated last year
- Zero-Shot Emotion Style Transfer☆43Updated 11 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆166Updated last year
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆62Updated 2 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- Official Implementation of StyleTTS-VC☆177Updated 2 months ago
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆145Updated last year
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆144Updated last year
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆214Updated 8 months ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆82Updated 2 months ago
- Putting flows on top of neural transducers for better TTS☆62Updated 3 weeks ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- A simple voice conversion tool☆17Updated 3 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆48Updated 8 months ago
- TransferTTS (Zero-Shot learning of VITS)☆95Updated 2 years ago
- VoiceBox neural network implementation☆105Updated 7 months ago
- A sequence-to-sequence voice conversion toolkit.☆96Updated 8 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆85Updated 11 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆68Updated last year
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆91Updated last year
- VoiceLDM: Text-to-Speech with Environmental Context☆172Updated 7 months ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆122Updated 2 years ago
- Demo for 2022 ICASSP☆64Updated 2 years ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆124Updated 4 months ago
- ☆64Updated 6 months ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆84Updated 2 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆155Updated 2 years ago