A little useful toolbox for python.
☆77Mar 30, 2020Updated 6 years ago
Alternatives and similar repositories for soja-box
Users that are interested in soja-box are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python interface to the WebRTC Noise Suppression☆18Dec 16, 2021Updated 4 years ago
- 利用webRTC对语音进行处理,实现VAD和降噪处理☆49Nov 13, 2018Updated 7 years ago
- LogMMSE speech enhancement/noise reduction☆89Apr 1, 2020Updated 6 years ago
- Convolutional neural nets for single channel speech enhancement☆144Dec 15, 2020Updated 5 years ago
- ☆15Jan 3, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Speech Enhancement using Bayesian WaveNet☆96Apr 1, 2018Updated 8 years ago
- Multi-hop Question Generation with Graph Convolutional Network☆30Nov 2, 2022Updated 3 years ago
- ☆24Oct 12, 2018Updated 7 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- Pitch shifter using WSOLA and resampling implemented by Python3☆40Jul 19, 2017Updated 8 years ago
- Towards Few-Shot Fact-Checking via Perplexity☆13Jun 11, 2021Updated 5 years ago
- Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)☆254Mar 13, 2019Updated 7 years ago
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- ☆13Jun 21, 2021Updated 5 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'☆41Aug 1, 2018Updated 7 years ago
- Elasticsearch中文社区的App客户端☆13May 5, 2016Updated 10 years ago
- DeepRec Extension is an easy-to-use, stable and efficient large-scale distributed training system based on DeepRec.☆13May 17, 2024Updated 2 years ago
- Scripts to prepare OXFORD VGG Face dataset☆12Mar 29, 2016Updated 10 years ago
- webrtc audio processing☆424May 10, 2020Updated 6 years ago
- CAiRE in DialDoc21: Data Augmentation for Information-SeekingDialogue System☆11May 24, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Generation tool for offset-resistant audio adversarial examples against Deepspeech☆10Oct 5, 2020Updated 5 years ago
- A collection of trending speech enhancement papers☆11Dec 4, 2020Updated 5 years ago
- ☆12Jun 11, 2020Updated 6 years ago
- Speaker embedding(verification and recognition) using Pytorch☆369Jul 24, 2020Updated 5 years ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆111Feb 6, 2025Updated last year
- SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enha…☆78Jan 19, 2025Updated last year
- Use openai whisper to transcribe your voice into written text completely locally in one command☆11Dec 7, 2023Updated 2 years ago
- ☆11Nov 11, 2021Updated 4 years ago
- Python bindings of WebRTC Audio Processing☆216May 7, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11May 26, 2020Updated 6 years ago
- Computer code and dataset for "Universal Deep Beamformer for Robust Ultrasound Imaging"☆18Jan 15, 2019Updated 7 years ago
- Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF☆189Mar 29, 2019Updated 7 years ago
- A Raspberry Pi intercom - Gofore hackathon project☆11Aug 18, 2020Updated 5 years ago
- This is the code&dataset for our paper [Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment. AAAI 2018]☆57Apr 12, 2018Updated 8 years ago
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- A pytorch implementation of xvector embedding☆79Mar 28, 2020Updated 6 years ago