Sindhu-Hegde / speaker-separation
Code for the cocktail-party problem of isolating and enhancing the speech for the target speaker
☆17Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for speaker-separation
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆103Updated 5 months ago
- Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"☆21Updated 3 years ago
- CycleGAN-based Emotion Style Transfer as Data Augmentation for Speech Emotion Recognition☆12Updated 5 years ago
- Speech Enhancement: Tensorflow 2.x implementation of the stacked dual-signal transformation LSTM network (DTLN) for Noise Suppression.☆14Updated 3 years ago
- Speaker Diarization using GRU in PyTorch☆11Updated 4 years ago
- Experiment with "one-shot learning" techniques to recognize a voice signature☆24Updated 4 years ago
- Time series course Fall 2019 project☆53Updated 4 years ago
- Code and models for evaluating a state-of-the-art lip reading network☆189Updated last year
- Identify the emotion of multiple speakers in an Audio Segment☆163Updated last year
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆115Updated 5 months ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆89Updated 2 years ago
- Include some core functions and model to handle speech separation☆154Updated 3 years ago
- This project is about performing Speaker diarization for Hindi Language.☆45Updated 3 years ago
- Detecting emotions using MFCC features of human speech using Deep Learning☆125Updated 3 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆58Updated 3 years ago
- small experimentation about positional encoding☆17Updated 4 years ago
- Urban Sound Classification : striving towards a fair comparison☆16Updated 3 years ago
- ☆24Updated 5 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 2 months ago
- Emotional Speech Conversion using Style Transfer and MUNIT☆33Updated 5 years ago
- Using Convolutional Neural Networks in speech emotion recognition on the RAVDESS Audio Dataset.☆137Updated 3 years ago
- Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)☆58Updated last year
- Code implementation for our DAS, 2020 paper titled "Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval"☆14Updated 3 months ago
- Detecting emotion in voices☆46Updated 5 years ago
- ☆90Updated last year
- Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.☆48Updated 5 years ago
- Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.☆42Updated 3 months ago
- Use machine learning models to detect lies based solely on acoustic speech information☆50Updated 5 years ago
- Identifying people from small audio fragments☆169Updated 4 years ago
- speaker_separation☆14Updated last year