Real-time Speech Separation, Noise Suppression & Speaker Recognition
☆18Apr 17, 2019Updated 6 years ago
Alternatives and similar repositories for audiovision
Users that are interested in audiovision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- ☆15Jun 15, 2022Updated 3 years ago
- ☆37Feb 23, 2022Updated 4 years ago
- DCCRN: Deep Complex Convolution Recurrent Network☆13Nov 26, 2021Updated 4 years ago
- Generalized RNN beamformer for speech separation☆18Jan 11, 2022Updated 4 years ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆27Jan 11, 2022Updated 4 years ago
- microphone array speech generator (MASG) in room acoustic☆39Jan 2, 2020Updated 6 years ago
- This is my graduation project in BIT. Title: Noise Reduction Using GRU.☆31May 25, 2023Updated 2 years ago
- ☆16Jan 20, 2021Updated 5 years ago
- multi-scale time domain speaker extraction☆73Jun 7, 2021Updated 4 years ago
- PyTorch implementation of LiMuSE☆32Oct 11, 2022Updated 3 years ago
- Sound field estimation based on physics-constrained neural kernel☆21Jun 9, 2025Updated 9 months ago
- This repository contains supplementary material for the paper: "Audio Source Separation Using Variational Autoencoders and Weak Class Sup…☆11Jan 10, 2023Updated 3 years ago
- Hybrid GAN (HiFi-WaveGAN) applied to footsteps sound effects☆12Jul 17, 2023Updated 2 years ago
- Documentation source for docs.drawthings.ai☆20Apr 26, 2024Updated last year
- Keras implementation of conditional waveGAN. Application to knocking sound effects with emotion.☆11Jun 22, 2020Updated 5 years ago
- A batch annotator to handle most of the preprocessors for Control Net☆20Aug 20, 2024Updated last year
- ☆38Jul 20, 2020Updated 5 years ago
- Classification of environmental sounds using first order statistics and GLCM (Gray-Level Co-Occurrence Matrix ) features of a spectrogram…☆25Jul 14, 2020Updated 5 years ago
- A CNN-based audio denoiser☆10May 2, 2021Updated 4 years ago
- CS230 Final Project - Audio Super Resolution☆13Jun 18, 2018Updated 7 years ago
- An open-source speech separation and enhancement library☆214May 13, 2020Updated 5 years ago
- Extends Model Context Protocol (MCP) to local LLMs via Ollama, enabling Claude-like tool use (files, web, email, GitHub, AI images) while…☆25Jun 17, 2025Updated 9 months ago
- Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".☆37Jul 19, 2020Updated 5 years ago
- RLVR Testing and Training☆23Aug 28, 2025Updated 6 months ago
- ☆16Nov 17, 2020Updated 5 years ago
- Audio source separation (mixture to vocal) using the Wavenet☆21Sep 6, 2017Updated 8 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 3 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- PyTorch based speaker embedding model☆16Apr 13, 2024Updated last year
- ☆16Sep 12, 2023Updated 2 years ago
- Consistent dictionary learning algorithm for signal declipping (Python code)☆20Oct 24, 2018Updated 7 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Jul 7, 2018Updated 7 years ago
- Official PyTorch implementation of MVAE for audio source separation☆43Dec 21, 2022Updated 3 years ago
- A pure-Python, bring-your-own-I/O implementation of HTTP/1.1☆13Oct 30, 2018Updated 7 years ago
- This repository is webrtc agc module demo.☆12Jan 23, 2019Updated 7 years ago
- ☆51Jun 14, 2022Updated 3 years ago
- ☆45Dec 5, 2019Updated 6 years ago
- MongoDB with Pymongo Tutorial☆10Apr 19, 2024Updated last year