facebookresearch / denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder ar…
☆1,637Updated last year
Related projects: ⓘ
- The PyTorch-based audio source separation toolkit for researchers☆2,224Updated 2 months ago
- This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.☆1,064Updated last month
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,541Updated 4 months ago
- Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.☆567Updated last year
- A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.☆1,812Updated this week
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆1,902Updated last month
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,563Updated 3 weeks ago
- Deep learning for audio denoising☆644Updated 11 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆658Updated 6 months ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆924Updated 2 weeks ago
- A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…☆705Updated 3 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,264Updated 3 months ago
- We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new …☆1,220Updated 10 months ago
- List of speech synthesis papers.☆989Updated last year
- Unofficial PyTorch implementation of Google AI's VoiceFilter system☆1,074Updated last month
- Tools for handling speech data in machine learning projects.☆932Updated this week
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆887Updated last year
- Audio processing by using pytorch 1D convolution network☆1,009Updated 7 months ago
- speech enhancement\speech seperation\sound source localization☆993Updated 10 months ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆1,672Updated 3 months ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,212Updated last week
- A must-read paper for speech separation based on neural networks☆735Updated 2 years ago
- In defence of metric learning for speaker recognition☆1,027Updated 5 months ago
- 🐸 collection of TTS papers☆614Updated 2 months ago
- Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)☆1,412Updated 2 months ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone☆878Updated last year
- Python interface to the WebRTC Voice Activity Detector☆2,014Updated 2 months ago
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆475Updated 2 months ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆959Updated last year
- An audio/acoustic activity detection and audio segmentation tool☆732Updated last year