itaa/soja-box

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/itaa/soja-box)

itaa / soja-box

A little useful toolbox for python.

☆77

Alternatives and similar repositories for soja-box

Users that are interested in soja-box are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WeldonWangwang / py-webrtcns
View on GitHub
Python interface to the WebRTC Noise Suppression
☆18Dec 16, 2021Updated 4 years ago
YorLife / webRTC-
View on GitHub
利用webRTC对语音进行处理，实现VAD和降噪处理
☆49Nov 13, 2018Updated 7 years ago
zhr1201 / CNN-for-single-channel-speech-enhancement
View on GitHub
Convolutional neural nets for single channel speech enhancement
☆144Dec 15, 2020Updated 5 years ago
orctom / rnnoise-java
View on GitHub
☆15Jan 3, 2018Updated 8 years ago
yongxuUSTC / sednn
View on GitHub
deep learning based speech enhancement using keras or pytorch, make it easy to use
☆339Feb 26, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
auspicious3000 / WaveNet-Enhancement
View on GitHub
Speech Enhancement using Bayesian WaveNet
☆96Apr 1, 2018Updated 8 years ago
seungheondoh / msu-benchmark
View on GitHub
music semantic understanding evaluation benchmark
☆24Aug 12, 2023Updated 2 years ago
justinsalamon / UrbanSound8K-JAMS
View on GitHub
JAMS annotation files for the original and augmented UrbanSound8K dataset
☆35Jan 31, 2018Updated 8 years ago
droneboost / airkiss_weixin
View on GitHub
☆11May 11, 2017Updated 9 years ago
Jeongseungwoo / Singing-Voice-Separation
View on GitHub
☆24Oct 12, 2018Updated 7 years ago
zhr1201 / Multi-channel-speech-extraction-using-DNN
View on GitHub
A tensorflow implementation of my paper Combining beamforming and deep neural networks for multi-channel speech extraction
☆69Dec 15, 2020Updated 5 years ago
wblgers / py_speech_seg
View on GitHub
A toolkit to implement segmentation on speech based on BIC and nerual network, such as BiLSTM
☆123Aug 7, 2019Updated 6 years ago
HLTCHKUST / MulQG
View on GitHub
Multi-hop Question Generation with Graph Convolutional Network
☆30Nov 2, 2022Updated 3 years ago
llxlr / Speech-Recognition-With-Python
View on GitHub
Speech Recognition With Python | python语音识别
☆21Jul 22, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Neclow / SERAB
View on GitHub
SERAB: a multi-lingual benchmark for speech emotion recognition
☆28Dec 16, 2022Updated 3 years ago
wanleg / snowboyPi
View on GitHub
snowboy setup on raspberry pi
☆16Feb 21, 2018Updated 8 years ago
k2kobayashi / Shifter
View on GitHub
Pitch shifter using WSOLA and resampling implemented by Python3
☆40Jul 19, 2017Updated 9 years ago
HLTCHKUST / Perplexity-FactChecking
View on GitHub
Towards Few-Shot Fact-Checking via Perplexity
☆13Jun 11, 2021Updated 5 years ago
noiseux1523 / NIST-SRE-2019
View on GitHub
Score Normalization for NIST 2019 Speaker Recognition Evaluation
☆10Nov 8, 2019Updated 6 years ago
gogyzzz / localatt_emorecog
View on GitHub
A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'
☆41Aug 1, 2018Updated 7 years ago
crouchred / speaker-recognition-py3
View on GitHub
Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)
☆254Mar 13, 2019Updated 7 years ago
PiSchool / spoken-language-id
View on GitHub
Spoken Language Identification from Short Utterances
☆13Jul 6, 2022Updated 4 years ago
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
DeepRec-AI / extension
View on GitHub
DeepRec Extension is an easy-to-use, stable and efficient large-scale distributed training system based on DeepRec.
☆13May 17, 2024Updated 2 years ago
bpucla / ibebm
View on GitHub
☆13Jun 21, 2021Updated 5 years ago
marcoromanelli-github / ReliabilityDiagrams
View on GitHub
Create reliability diagrams to quantify ML calibration.
☆10Feb 1, 2022Updated 4 years ago
chenxi1103 / Face_Recognition_Project
View on GitHub
Gender/Race/Emotion classifications based on facial multi-attribute detection were realized through data pre-processing, face detection a…
☆12Dec 31, 2018Updated 7 years ago
vuvko / prepare-oxford-faces
View on GitHub
Scripts to prepare OXFORD VGG Face dataset
☆12Mar 29, 2016Updated 10 years ago
Fraunhofer-AISEC / towards-resistant-audio-adversarial-examples
View on GitHub
Generation tool for offset-resistant audio adversarial examples against Deepspeech
☆10Oct 5, 2020Updated 5 years ago
lix321 / leetcode
View on GitHub
☆12Jun 11, 2020Updated 6 years ago
sorenchiron / Awesome-Speech-Enhancement
View on GitHub
A collection of trending speech enhancement papers
☆11Dec 4, 2020Updated 5 years ago
keikeiqi / MGTTA
View on GitHub
AAAI2025
☆13Apr 18, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
qqueing / DeepSpeaker-pytorch
View on GitHub
Speaker embedding(verification and recognition) using Pytorch
☆369Jul 24, 2020Updated 6 years ago
IMLHF / WFb_SE
View on GitHub
(tensorflow) Wiener Filter based Speech Enhancement（LSTM/BLSTM, GRU/BGRU, Transformer）
☆15Dec 3, 2019Updated 6 years ago
fgnt / pb_chime5
View on GitHub
Speech enhancement system for the CHiME-5 dinner party scenario
☆111Feb 6, 2025Updated last year
bagustris / SER_ICSigSys2019
View on GitHub
Repository of code for Speech emotion recognition using voiced speech and attention model, submitted to ICSigSys 2019
☆13Jan 6, 2020Updated 6 years ago
a-n-rose / Python-Sound-Tool
View on GitHub
SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enha…
☆79Jan 19, 2025Updated last year
HLTCHKUST / cqr4cqa
View on GitHub
☆13Sep 6, 2022Updated 3 years ago
tmlr-group / ZS-NTTA
View on GitHub
[ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"
☆13Feb 22, 2025Updated last year