Sindhu-Hegde / speaker-separationLinks
Code for the cocktail-party problem of isolating and enhancing the speech for the target speaker
☆17Updated 3 years ago
Alternatives and similar repositories for speaker-separation
Users that are interested in speaker-separation are comparing it to the libraries listed below
Sorting:
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆107Updated last year
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆74Updated 2 years ago
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆121Updated 11 months ago
- Time series course Fall 2019 project☆53Updated 4 years ago
- ☆45Updated 7 years ago
- Speaker Diarization using GRU in PyTorch☆11Updated 4 years ago
- Real-time Speech Separation, Noise Suppression & Speaker Recognition☆18Updated 6 years ago
- The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)☆37Updated 2 years ago
- Utils and data sets for audio and PyTorch☆85Updated 3 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆175Updated last year
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- small experimentation about positional encoding☆19Updated 5 years ago
- 1st Place Public Leaderboard Solution for ERC2019☆70Updated 5 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆108Updated last year
- Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.☆13Updated 3 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆103Updated last year
- An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras☆43Updated 3 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆90Updated 4 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 9 months ago
- Lip Reading in the Wild using ResNet and LSTMs in PyTorch☆58Updated 7 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 4 years ago
- Using Convolutional Neural Networks in speech emotion recognition on the RAVDESS Audio Dataset.☆138Updated 4 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 2 years ago
- Accent Classification in Speech☆25Updated 5 years ago
- Data repository of Project Coswara☆187Updated last year
- ☆24Updated 6 years ago
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126Updated 5 years ago