Sindhu-Hegde / speaker-separationLinks
Code for the cocktail-party problem of isolating and enhancing the speech for the target speaker
☆17Updated 3 years ago
Alternatives and similar repositories for speaker-separation
Users that are interested in speaker-separation are comparing it to the libraries listed below
Sorting:
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆126Updated last year
- Data repository of Project Coswara☆198Updated 2 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆108Updated last year
- A collection of Audio and Speech pre-trained models.☆193Updated 5 years ago
- Speaker Diarization using GRU in PyTorch☆11Updated 5 years ago
- Time series course Fall 2019 project☆53Updated 5 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆77Updated 3 years ago
- A neural attention model for speech command recognition☆186Updated 5 months ago
- Using Convolutional Neural Networks in speech emotion recognition on the RAVDESS Audio Dataset.☆144Updated 4 years ago
- SEGAN pytorch implementation https://arxiv.org/abs/1703.09452☆111Updated 6 years ago
- Voice Conversion using Cycle GAN's For Non-Parallel Data☆124Updated 7 years ago
- Sound Classification using Neural Networks☆50Updated 3 years ago
- PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper☆15Updated 3 years ago
- ☆98Updated 6 years ago
- ☆30Updated 3 years ago
- small experimentation about positional encoding☆19Updated 5 years ago
- Understanding emotions from audio files using neural networks and multiple datasets.☆425Updated 2 years ago
- CycleGAN-based Emotion Style Transfer as Data Augmentation for Speech Emotion Recognition☆12Updated 6 years ago
- Include some core functions and model to handle speech separation☆156Updated 4 years ago
- Generated Audio Samples by ALGAN-VC model are available in the folder☆19Updated 3 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 5 years ago
- An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras☆45Updated 3 years ago
- Detecting emotions using MFCC features of human speech using Deep Learning☆131Updated 5 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆176Updated last year
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- INCREASING COMPACTNESS OF DEEP LEARNING BASED SPEECH ENHANCEMENT MODELS WITH PARAMETER PRUNING AND QUANTIZATION TECHNIQUES☆14Updated 6 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆104Updated 2 years ago
- The official repository for Audio ALBERT☆67Updated 3 years ago
- Emotional Speech Conversion using Style Transfer and MUNIT☆37Updated 6 years ago
- TensorFlow Implementation of CDVAE-VC.☆54Updated 2 years ago