Sindhu-Hegde / speaker-separation
Code for the cocktail-party problem of isolating and enhancing the speech for the target speaker
☆17Updated 2 years ago
Alternatives and similar repositories for speaker-separation:
Users that are interested in speaker-separation are comparing it to the libraries listed below
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆104Updated 8 months ago
- Speaker Diarization using GRU in PyTorch☆11Updated 4 years ago
- Detecting emotions using MFCC features of human speech using Deep Learning☆125Updated 4 years ago
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆116Updated 7 months ago
- Data repository of Project Coswara☆182Updated last year
- Sound Classification using Neural Networks☆49Updated 2 years ago
- Generated Audio Samples by ALGAN-VC model are available in the folder☆16Updated 2 years ago
- Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)☆58Updated last year
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- Speech Enhancement: Tensorflow 2.x implementation of the stacked dual-signal transformation LSTM network (DTLN) for Noise Suppression.☆16Updated 3 years ago
- ☆20Updated 3 years ago
- CycleGAN-based Emotion Style Transfer as Data Augmentation for Speech Emotion Recognition☆12Updated 5 years ago
- This project is about performing Speaker diarization for Hindi Language.☆48Updated 3 years ago
- Using Convolutional Neural Networks in speech emotion recognition on the RAVDESS Audio Dataset.☆138Updated 3 years ago
- Speech Denoising using RNNs in Tensorflow☆23Updated 6 years ago
- Urban Sound Classification : striving towards a fair comparison☆17Updated 4 years ago
- Urban sounds classification with Covnolutional Neural Networks☆36Updated 5 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126Updated 4 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- SEGAN pytorch implementation https://arxiv.org/abs/1703.09452☆107Updated 5 years ago
- MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement (ICML 2019, with Travel awar…☆136Updated 3 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆29Updated 7 months ago
- Text to Speech for Indic languages☆50Updated 2 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆66Updated 4 years ago
- Include some core functions and model to handle speech separation☆155Updated 3 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆90Updated 3 years ago
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆49Updated 3 years ago
- An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras☆42Updated 3 years ago
- ☆46Updated 6 years ago
- ☆7Updated 3 years ago