Sindhu-Hegde / speaker-separation
Code for the cocktail-party problem of isolating and enhancing the speech for the target speaker
☆17Updated 3 years ago
Alternatives and similar repositories for speaker-separation:
Users that are interested in speaker-separation are comparing it to the libraries listed below
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆105Updated 10 months ago
- ☆21Updated 3 years ago
- Speech Enhancement: Tensorflow 2.x implementation of the stacked dual-signal transformation LSTM network (DTLN) for Noise Suppression.☆16Updated 3 years ago
- Time series course Fall 2019 project☆54Updated 4 years ago
- CycleGAN-based Emotion Style Transfer as Data Augmentation for Speech Emotion Recognition☆12Updated 5 years ago
- This project is about performing Speaker diarization for Hindi Language.☆49Updated 4 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆112Updated 3 years ago
- Emotion recognition library for PyTorch☆21Updated 4 years ago
- Generated Audio Samples by ALGAN-VC model are available in the folder☆17Updated 3 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆107Updated last year
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆118Updated 9 months ago
- [NeurIPS 2020] Official repository for the project "Listening to Sound of Silence for Speech Denoising"☆55Updated 3 years ago
- Text to Speech for Indic languages☆50Updated 3 years ago
- Real-time Speech Separation, Noise Suppression & Speaker Recognition☆18Updated 5 years ago
- A Pytorch Implementation of MelGAN☆67Updated 5 years ago
- small experimentation about positional encoding☆19Updated 5 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆86Updated 2 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆101Updated last year
- ☆18Updated 2 years ago
- Enhancment of Audio Quality (Bit-Depth and Sampling-Rate) using Deep Learning.☆33Updated 5 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆91Updated 3 years ago
- Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf☆62Updated last year
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- SEGAN pytorch implementation https://arxiv.org/abs/1703.09452☆108Updated 6 years ago
- Emotional Speech Conversion using Nonparallel Data☆16Updated 5 years ago
- ☆47Updated 4 years ago
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆124Updated 4 years ago
- Feature extractor for DL speech processing.☆65Updated 2 years ago
- Voice Conversion using Cycle GAN's For Non-Parallel Data☆121Updated 6 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago