A little useful toolbox for python.
☆77Mar 30, 2020Updated 5 years ago
Alternatives and similar repositories for soja-box
Users that are interested in soja-box are comparing it to the libraries listed below
Sorting:
- Python interface to the WebRTC Noise Suppression☆19Dec 16, 2021Updated 4 years ago
- 利用webRTC对语音进行处理,实现VAD和降噪处理☆51Nov 13, 2018Updated 7 years ago
- LogMMSE speech enhancement/noise reduction☆90Apr 1, 2020Updated 5 years ago
- Convolutional neural nets for single channel speech enhancement☆144Dec 15, 2020Updated 5 years ago
- deep learning based speech enhancement using keras or pytorch, make it easy to use☆339Feb 26, 2020Updated 6 years ago
- Speech Enhancement using Bayesian WaveNet☆98Apr 1, 2018Updated 7 years ago
- speaker recognition using keras☆36Nov 29, 2022Updated 3 years ago
- music semantic understanding evaluation benchmark☆25Aug 12, 2023Updated 2 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Jan 31, 2018Updated 8 years ago
- A tensorflow implementation of my paper Combining beamforming and deep neural networks for multi-channel speech extraction☆68Dec 15, 2020Updated 5 years ago
- A toolkit to implement segmentation on speech based on BIC and nerual network, such as BiLSTM☆123Aug 7, 2019Updated 6 years ago
- ☆24Oct 12, 2018Updated 7 years ago
- Speech Recognition With Python | python语音识别☆20Jul 22, 2022Updated 3 years ago
- android自定义屏保 视频屏保☆10Jul 4, 2018Updated 7 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- Free, studio-quality reverb SOURCE CODE in the public domain☆10May 10, 2022Updated 3 years ago
- Pitch shifter using WSOLA and resampling implemented by Python3☆39Jul 19, 2017Updated 8 years ago
- Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)☆255Mar 13, 2019Updated 7 years ago
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- Create reliability diagrams to quantify ML calibration.☆10Feb 1, 2022Updated 4 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- Elasticsearch中文社区的App客户端☆13May 5, 2016Updated 9 years ago
- Gender/Race/Emotion classifications based on facial multi-attribute detection were realized through data pre-processing, face detection a…☆11Dec 31, 2018Updated 7 years ago
- DeepRec Extension is an easy-to-use, stable and efficient large-scale distributed training system based on DeepRec.☆12May 17, 2024Updated last year
- webrtc audio processing☆418May 10, 2020Updated 5 years ago
- Scripts to prepare OXFORD VGG Face dataset☆12Mar 29, 2016Updated 9 years ago
- Generation tool for offset-resistant audio adversarial examples against Deepspeech☆10Oct 5, 2020Updated 5 years ago
- Speaker embedding(verification and recognition) using Pytorch☆369Jul 24, 2020Updated 5 years ago
- A collection of trending speech enhancement papers☆11Dec 4, 2020Updated 5 years ago
- ☆12Jun 11, 2020Updated 5 years ago
- snowboy setup on raspberry pi☆16Feb 21, 2018Updated 8 years ago
- AAAI2025☆11Apr 18, 2025Updated 11 months ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Feb 6, 2025Updated last year
- SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enha…☆77Jan 19, 2025Updated last year
- Use openai whisper to transcribe your voice into written text completely locally in one command☆11Dec 7, 2023Updated 2 years ago
- ☆11Nov 11, 2021Updated 4 years ago
- This is the code&dataset for our paper [Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment. AAAI 2018]☆57Apr 12, 2018Updated 7 years ago
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆12Feb 22, 2025Updated last year
- Python bindings of WebRTC Audio Processing☆213May 7, 2025Updated 10 months ago