A little useful toolbox for python.
☆77Mar 30, 2020Updated 6 years ago
Alternatives and similar repositories for soja-box
Users that are interested in soja-box are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python interface to the WebRTC Noise Suppression☆19Dec 16, 2021Updated 4 years ago
- 利用webRTC对语音进行处理,实现VAD和降噪处理☆51Nov 13, 2018Updated 7 years ago
- Convolutional neural nets for single channel speech enhancement☆144Dec 15, 2020Updated 5 years ago
- ☆15Dec 7, 2022Updated 3 years ago
- deep learning based speech enhancement using keras or pytorch, make it easy to use☆339Feb 26, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆15Jan 3, 2018Updated 8 years ago
- speaker recognition using keras☆36Nov 29, 2022Updated 3 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Jan 31, 2018Updated 8 years ago
- Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision☆11Jul 22, 2024Updated last year
- A tensorflow implementation of my paper Combining beamforming and deep neural networks for multi-channel speech extraction☆68Dec 15, 2020Updated 5 years ago
- Multi-hop Question Generation with Graph Convolutional Network☆30Nov 2, 2022Updated 3 years ago
- A toolkit to implement segmentation on speech based on BIC and nerual network, such as BiLSTM☆123Aug 7, 2019Updated 6 years ago
- ☆24Oct 12, 2018Updated 7 years ago
- Speech Recognition With Python | python语音识别☆20Jul 22, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- Data and Code for Paper "Reflect Not Reflex: Inference-Based Common Ground Improves Dialogue Response Quality" (EMNLP 2022)☆11Nov 28, 2022Updated 3 years ago
- Free, studio-quality reverb SOURCE CODE in the public domain☆10May 10, 2022Updated 3 years ago
- Pitch shifter using WSOLA and resampling implemented by Python3☆39Jul 19, 2017Updated 8 years ago
- Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)☆255Mar 13, 2019Updated 7 years ago
- Separating Anything from Image in Context☆12May 29, 2024Updated last year
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Fast Music Indexing PoC based on Shazam☆34May 18, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'☆41Aug 1, 2018Updated 7 years ago
- Gender/Race/Emotion classifications based on facial multi-attribute detection were realized through data pre-processing, face detection a…☆11Dec 31, 2018Updated 7 years ago
- DeepRec Extension is an easy-to-use, stable and efficient large-scale distributed training system based on DeepRec.☆13May 17, 2024Updated last year
- Scripts to prepare OXFORD VGG Face dataset☆12Mar 29, 2016Updated 10 years ago
- webrtc audio processing☆419May 10, 2020Updated 5 years ago
- A collection of trending speech enhancement papers☆11Dec 4, 2020Updated 5 years ago
- ☆12Jun 11, 2020Updated 5 years ago
- Speaker embedding(verification and recognition) using Pytorch☆369Jul 24, 2020Updated 5 years ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Feb 6, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enha…☆77Jan 19, 2025Updated last year
- ☆11Nov 11, 2021Updated 4 years ago
- [CVPR2024] Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation☆19Sep 3, 2024Updated last year
- ☆13Sep 6, 2022Updated 3 years ago
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆12Feb 22, 2025Updated last year
- Python bindings of WebRTC Audio Processing☆213May 7, 2025Updated 11 months ago
- ☆11May 26, 2020Updated 5 years ago