muhammad-ahmed-ghani / svoice_demoLinks
A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.
☆36Updated last year
Alternatives and similar repositories for svoice_demo
Users that are interested in svoice_demo are comparing it to the libraries listed below
Sorting:
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆153Updated last year
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Updated last year
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆148Updated last year
- General Speech Restoration☆283Updated last year
- Your one-stop solution for voice dataset creation☆128Updated 2 years ago
- A deep neural network architecture for low-latency audio processing☆321Updated 2 years ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆260Updated last month
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated 2 years ago
- ☆130Updated 2 years ago
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using…☆100Updated last year
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆128Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Python forced alignment☆94Updated last year
- Official Implementation of StyleTTS-VC☆193Updated 11 months ago
- Desktop application for neural speech synthesis written in C++☆213Updated 2 years ago
- Putting flows on top of neural transducers for better TTS☆64Updated 2 weeks ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆124Updated 3 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆56Updated 7 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆177Updated last year
- Demo for 2022 ICASSP☆64Updated 3 years ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆268Updated 5 months ago
- Finally, some decent sample sentences☆23Updated 2 years ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 3 years ago
- Create training data for training a voice cloner for bark text to speech.☆48Updated 2 years ago
- Toolbox for easy and qualitative one-shot voice conversion☆46Updated 4 years ago
- A simple voice conversion tool☆19Updated 3 years ago
- [WIP] VoiceSmith makes training text to speech models easy.☆228Updated 3 years ago
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆179Updated last year
- Community framework for training tortoise☆44Updated 3 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆67Updated 3 years ago