Sindhu-Hegde / speaker-separationLinks
Code for the cocktail-party problem of isolating and enhancing the speech for the target speaker
☆17Updated 3 years ago
Alternatives and similar repositories for speaker-separation
Users that are interested in speaker-separation are comparing it to the libraries listed below
Sorting:
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆108Updated last year
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆125Updated last year
- Identifying people from small audio fragments☆170Updated 5 years ago
- Speaker Diarization using GRU in PyTorch☆11Updated 5 years ago
- Time series course Fall 2019 project☆53Updated 5 years ago
- CycleGAN-based Emotion Style Transfer as Data Augmentation for Speech Emotion Recognition☆12Updated 6 years ago
- Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"☆21Updated 4 years ago
- ☆90Updated 2 years ago
- An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras☆45Updated 3 years ago
- Understanding emotions from audio files using neural networks and multiple datasets.☆422Updated 2 years ago
- Data repository of Project Coswara☆194Updated 2 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆76Updated 3 years ago
- Text to Speech with PyTorch (English and Mongolian)☆184Updated last year
- Using Convolutional Neural Networks in speech emotion recognition on the RAVDESS Audio Dataset.☆141Updated 4 years ago
- A collection of Audio and Speech pre-trained models.☆194Updated 5 years ago
- small experimentation about positional encoding☆19Updated 5 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆88Updated 3 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆178Updated 2 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆90Updated 3 years ago
- Detecting emotions using MFCC features of human speech using Deep Learning☆130Updated 4 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆175Updated last year
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 4 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126Updated 5 years ago
- Implementing VGGVox for Speaker Identification on VoxCeleb1 dataset in PyTorch.☆25Updated 5 years ago
- Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.☆13Updated 3 years ago
- ☆138Updated last year
- Emotional Speech Conversion using Style Transfer and MUNIT☆36Updated 6 years ago
- Include some core functions and model to handle speech separation☆155Updated 4 years ago
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆211Updated 5 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆129Updated 4 years ago