Sindhu-Hegde / speaker-separationLinks
Code for the cocktail-party problem of isolating and enhancing the speech for the target speaker
☆17Updated 3 years ago
Alternatives and similar repositories for speaker-separation
Users that are interested in speaker-separation are comparing it to the libraries listed below
Sorting:
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆123Updated last year
- Speaker Diarization using GRU in PyTorch☆11Updated 4 years ago
- Identifying people from small audio fragments☆170Updated 5 years ago
- Data repository of Project Coswara☆189Updated 2 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆108Updated last year
- Time series course Fall 2019 project☆53Updated 4 years ago
- CycleGAN-based Emotion Style Transfer as Data Augmentation for Speech Emotion Recognition☆12Updated 5 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆102Updated last year
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆74Updated 2 years ago
- A neural attention model for speech command recognition☆185Updated 2 weeks ago
- ☆137Updated 10 months ago
- Voice Conversion using Cycle GAN's For Non-Parallel Data☆122Updated 6 years ago
- ☆30Updated 2 years ago
- Urdu Language Speech Emotional Corpus☆46Updated 6 years ago
- small experimentation about positional encoding☆19Updated 5 years ago
- Include some core functions and model to handle speech separation☆155Updated 4 years ago
- Understanding emotions from audio files using neural networks and multiple datasets.☆420Updated 2 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 2 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆90Updated 3 years ago
- A collection of Audio and Speech pre-trained models.☆192Updated 5 years ago
- ☆90Updated 2 years ago
- INCREASING COMPACTNESS OF DEEP LEARNING BASED SPEECH ENHANCEMENT MODELS WITH PARAMETER PRUNING AND QUANTIZATION TECHNIQUES☆14Updated 5 years ago
- Emotion recognition library for PyTorch☆22Updated 4 years ago
- SEGAN pytorch implementation https://arxiv.org/abs/1703.09452