facebookresearch / svoiceLinks
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, wh…
☆1,297Updated last year
Alternatives and similar repositories for svoice
Users that are interested in svoice are comparing it to the libraries listed below
Sorting:
- Unofficial PyTorch implementation of Google AI's VoiceFilter system☆1,142Updated 10 months ago
- The PyTorch-based audio source separation toolkit for researchers☆2,402Updated 5 months ago
- ☆679Updated 8 months ago
- A must-read paper for speech separation based on neural networks☆784Updated last month
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…☆1,794Updated 2 years ago
- Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.☆626Updated last year
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,052Updated 5 months ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,336Updated last year
- This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.☆1,232Updated 10 months ago
- A library for speech data augmentation in time-domain☆664Updated 3 years ago
- Deep learning for audio denoising☆713Updated last year
- An open source dataset for source separation☆429Updated last year
- Audio processing by using pytorch 1D convolution network☆1,070Updated last month
- A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…☆774Updated 4 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆511Updated 3 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆368Updated this week
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,607Updated last year
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,065Updated 3 weeks ago
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆687Updated 8 months ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,763Updated 8 months ago
- Tools for handling multimodal data in machine learning projects.☆1,028Updated last week
- Large, modern dataset for speech recognition☆677Updated last year
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆861Updated last year
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆973Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆802Updated 6 months ago
- List of speech synthesis papers.☆1,045Updated last year
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆483Updated 3 years ago
- 🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.☆1,149Updated last year
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆509Updated 3 years ago
- speech enhancement\speech seperation\sound source localization☆1,146Updated last year