jadfegh/audiovision

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jadfegh/audiovision)

jadfegh / audiovision

Real-time Speech Separation, Noise Suppression & Speaker Recognition

☆17

Alternatives and similar repositories for audiovision

Users that are interested in audiovision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 5 years ago
gemengtju / L-SpEx
View on GitHub
☆39Feb 23, 2022Updated 4 years ago
zexupan / avse_hybrid_loss
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
yongxuUSTC / grnnbf
View on GitHub
Generalized RNN beamformer for speech separation
☆18Jan 11, 2022Updated 4 years ago
PoKoHA / Speech_Enhancement-DCCRN
View on GitHub
DCCRN: Deep Complex Convolution Recurrent Network
☆14Nov 26, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
aispeech-lab / WASE
View on GitHub
PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…
☆27Jan 11, 2022Updated 4 years ago
yingtaoluo / Complex-Wavelet-Inception-GAN-Audio-Synthesis
View on GitHub
☆16Jan 20, 2021Updated 5 years ago
xuchenglin28 / speaker_extraction_SpEx
View on GitHub
multi-scale time domain speaker extraction
☆81Jun 7, 2021Updated 5 years ago
aispeech-lab / LiMuSE
View on GitHub
PyTorch implementation of LiMuSE
☆33Oct 11, 2022Updated 3 years ago
ShinoharaYuuyoru / NoiseReductionUsingGRU
View on GitHub
This is my graduation project in BIT. Title: Noise Reduction Using GRU.
☆32May 25, 2023Updated 3 years ago
vipchengrui / MASG
View on GitHub
microphone array speech generator (MASG) in room acoustic
☆39Jan 2, 2020Updated 6 years ago
BUTSpeechFIT / speakerbeam
View on GitHub
☆146Oct 25, 2021Updated 4 years ago
ertug / Weak_Class_Source_Separation
View on GitHub
Source code and audio demos for the paper "Audio Source Separation Using Variational Autoencoders and Weak Class Supervision"
☆11Jun 21, 2026Updated last month
fgnt / pb_chime5
View on GitHub
Speech enhancement system for the CHiME-5 dinner party scenario
☆111Feb 6, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
dtake1336 / ERNN-for-speech-enhancement
View on GitHub
☆38Jul 20, 2020Updated 6 years ago
amogh3892 / Environmental-sound-recognition-using-combination-of-spectrogram-and-acoustic-features
View on GitHub
Classification of environmental sounds using first order statistics and GLCM (Gray-Level Co-Occurrence Matrix ) features of a spectrogram…
☆25Jul 14, 2020Updated 6 years ago
kts707 / real-time-audio-denoiser
View on GitHub
A CNN-based audio denoiser
☆10May 2, 2021Updated 5 years ago
newjins-papa / android-rnnoise
View on GitHub
☆16Nov 17, 2020Updated 5 years ago
haoxiangsnr / SpEx
View on GitHub
Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".
☆37Jul 19, 2020Updated 6 years ago
ws-choi / LASAFT-Net-v2
View on GitHub
A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"
☆33Apr 11, 2022Updated 4 years ago
Le-Xiaohuai-speech / SKIP-DPCRN
View on GitHub
☆52Jun 14, 2022Updated 4 years ago
srigalibe / CS231n-Python-NumPy-Tutorial
View on GitHub
Teaching materials for the Convolutional Neural Networks for Visual Recognition (http://cs231n.github.io/python-numpy-tutorial/) classes …
☆26Mar 13, 2019Updated 7 years ago
Aworselife / DPTBF
View on GitHub
☆17Sep 12, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
manojahi / Project-Search-A-Recommendation-system-for-Youtube-video-and-Amazon-Product-based-on-user-comments
View on GitHub
Project Search is a Recommendation system for Youtube videos and Amazon products.
☆11May 10, 2017Updated 9 years ago
Totoketchup / Adaptive-MultiSpeaker-Separation
View on GitHub
Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem
☆50Jul 7, 2018Updated 8 years ago
lili-0805 / MVAE
View on GitHub
Official PyTorch implementation of MVAE for audio source separation
☆43Dec 21, 2022Updated 3 years ago
leohuang2013 / pyannote-audio_overlapped-speech-detection_cpp
View on GitHub
C++ version of pyannote audio overlapped speech detection pipeline
☆13Feb 14, 2024Updated 2 years ago
AkojimaSLP / Neural-mask-estimation
View on GitHub
☆46Dec 5, 2019Updated 6 years ago
CODEJIN / Speaker_Embedding_Torch
View on GitHub
PyTorch based speaker embedding model
☆16Apr 13, 2024Updated 2 years ago
gpip / cBKTree
View on GitHub
bktree data structure with a Python interface for a CPP implementation
☆13Jan 11, 2017Updated 9 years ago
HuangZikang-TJU / Aug4TSE
View on GitHub
☆15Sep 16, 2024Updated last year
zyfu0000 / lameHelper
View on GitHub
A c++ wrapper for the LAME library that reduces conversion of PCM (*.wav) to mp3 and vice versa to just two lines of codes.
☆12Jan 8, 2015Updated 11 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ZhaZhaFon / resource_speech
View on GitHub
语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download
☆60Jul 24, 2022Updated 3 years ago
nsu-ai-team / noise_supression
View on GitHub
Python package for noise supression in audio based on DNN
☆22Mar 24, 2023Updated 3 years ago
ostris / batch-annotator
View on GitHub
A batch annotator to handle most of the preprocessors for Control Net
☆21Aug 20, 2024Updated last year
stdKonjac / DeepComplexCRN
View on GitHub
☆13Mar 22, 2021Updated 5 years ago
yuguochencuc / SF-Net
View on GitHub
The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"
☆53Feb 16, 2023Updated 3 years ago
speechLabBcCuny / onssen
View on GitHub
An open-source speech separation and enhancement library
☆214May 13, 2020Updated 6 years ago
OlivierLDff / QtMacCMake
View on GitHub
💻 CMake function that wrap macdeployqt, deploy dmg and pkg.
☆11Jan 8, 2026Updated 6 months ago