Code for the cocktail-party problem of isolating and enhancing the speech for the target speaker
☆18Mar 11, 2022Updated 4 years ago
Alternatives and similar repositories for speaker-separation
Users that are interested in speaker-separation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Aug 7, 2021Updated 4 years ago
- Bone/Air conducted speech signal enhancement exploiting multi-modal framework☆16Oct 15, 2020Updated 5 years ago
- INCREASING COMPACTNESS OF DEEP LEARNING BASED SPEECH ENHANCEMENT MODELS WITH PARAMETER PRUNING AND QUANTIZATION TECHNIQUES☆15Oct 18, 2019Updated 6 years ago
- Digital Notes of the 5th Summer School on Artificial Intelligence by CVIT, IIITH☆23Sep 4, 2021Updated 4 years ago
- ☆15Nov 7, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.☆12Jun 22, 2023Updated 2 years ago
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Oct 13, 2022Updated 3 years ago
- Visual Speech Recognition For Low-Resource Languages with Automatic Labels (ICASSP 2024)☆16Mar 17, 2025Updated last year
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 9 months ago
- Applying Deep Reinforcement Learning for dialogue generation. aka chatbot☆13Apr 30, 2017Updated 8 years ago
- Langchain_CrewAI_Gemini - An Gemini AI powered AI Agent (Multi-Agent) Project.☆13Mar 24, 2024Updated 2 years ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PAGAN: a phase-adapted GAN for speech enhancement☆36Sep 17, 2020Updated 5 years ago
- ☆15Feb 27, 2026Updated last month
- Official implementation of Transpotter, published in BMVC 2021☆16Aug 6, 2022Updated 3 years ago
- Cardiovascular Disease Classification Employing Empirical Mode Decomposition (EMD) of Modified ECG☆12Oct 6, 2023Updated 2 years ago
- ☆46Sep 13, 2020Updated 5 years ago
- Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"☆21May 21, 2021Updated 4 years ago
- ☆10Feb 13, 2025Updated last year
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆29Apr 16, 2024Updated last year
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆11Jun 14, 2024Updated last year
- ☆13Jun 24, 2021Updated 4 years ago
- A simple AI/ML tool for non-technical creatives☆11May 5, 2023Updated 2 years ago
- Implementation of "Knowing your FATE: Friendship, Action and Temporal Explanations for User Engagement Prediction on Social Apps"☆12Feb 21, 2020Updated 6 years ago
- ☆16Jun 15, 2022Updated 3 years ago
- The PyTorch code for paper: An Affect-Rich Neural Conversational Model with Biased Attention and Weighted Cross-Entropy Loss☆12Oct 7, 2019Updated 6 years ago
- Code for "A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models"☆17Jul 20, 2025Updated 8 months ago
- Parse Symcat (http://www.symcat.com) symptoms and conditions and generate valid Synthea (https://github.com/synthetichealth/synthea) modu…☆16Jan 28, 2021Updated 5 years ago
- Hospital simulator with pedestrians and robot☆15Oct 20, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Speaker Diarization using GRU in PyTorch☆11Aug 29, 2020Updated 5 years ago
- ☆135Oct 25, 2021Updated 4 years ago
- An audio filter bank implementation in Python, contains ERB and linear filter banks☆59May 4, 2018Updated 7 years ago
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- Applied Reinforcement Learning course☆12Feb 14, 2023Updated 3 years ago
- Audio samples for the paper 'Phase-aware music super-resolution using generative adversarial networks'☆14May 15, 2020Updated 5 years ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆13Jul 22, 2024Updated last year