Sindhu-Hegde / speaker-separation
Code for the cocktail-party problem of isolating and enhancing the speech for the target speaker
☆17Updated 3 years ago
Alternatives and similar repositories for speaker-separation
Users that are interested in speaker-separation are comparing it to the libraries listed below
Sorting:
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆107Updated 11 months ago
- ☆21Updated 3 years ago
- Data repository of Project Coswara☆187Updated last year
- Time series course Fall 2019 project☆54Updated 4 years ago
- Audio Classification using Image Classification☆48Updated 5 years ago
- Speaker Diarization using GRU in PyTorch☆11Updated 4 years ago
- Speech Enhancement: Tensorflow 2.x implementation of the stacked dual-signal transformation LSTM network (DTLN) for Noise Suppression.☆16Updated 4 years ago
- Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.☆12Updated 3 years ago
- Emotional Speech Conversion using Style Transfer and MUNIT☆33Updated 6 years ago
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆121Updated 11 months ago
- Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.☆46Updated 9 months ago
- COVID-19 Coughs files for training AI models☆41Updated 4 years ago
- Collection of research papers on cough classification☆39Updated 5 years ago
- This project is about performing Speaker diarization for Hindi Language.☆49Updated 4 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆107Updated last year
- Real-time Speech Separation, Noise Suppression & Speaker Recognition☆18Updated 6 years ago
- Sound Classification using Neural Networks☆50Updated 2 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆103Updated last year
- Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.☆48Updated 6 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 2 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆26Updated 9 months ago
- Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)☆60Updated 2 years ago
- Text to Speech for Indic languages☆50Updated 3 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Updated last year
- Detecting emotions using MFCC features of human speech using Deep Learning☆128Updated 4 years ago
- Using Convolutional Neural Networks in speech emotion recognition on the RAVDESS Audio Dataset.☆137Updated 4 years ago
- ☆10Updated 5 years ago
- ☆131Updated 8 months ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 4 years ago