raymondxyy/pyaudlib

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/raymondxyy/pyaudlib)

raymondxyy / pyaudlib

A speech signal processing library in Python with emphasis on deep learning.

☆31

Alternatives and similar repositories for pyaudlib

Users that are interested in pyaudlib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

etzinis / biased_separation
View on GitHub
Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation
☆14Nov 16, 2020Updated 5 years ago
frankenliu / LOAE
View on GitHub
☆10Sep 25, 2024Updated last year
patrickltobing / shallow-wavenet
View on GitHub
☆18Feb 9, 2020Updated 6 years ago
bootphon / learnable-strf
View on GitHub
Learnable STRF, from Riad et al. 2021 JASA
☆13Aug 21, 2021Updated 4 years ago
Akshat4112 / voicenet
View on GitHub
Comprehensive Python library for speech and voice.
☆32Dec 8, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Yolanda-Gao / VoiceGANmodel
View on GitHub
☆19Feb 28, 2018Updated 8 years ago
tvuong123 / ModulationDomainLoss
View on GitHub
Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021
☆44Oct 14, 2021Updated 4 years ago
JinjiangLiu / ICCRN
View on GitHub
☆18Mar 10, 2023Updated 3 years ago
schufo / tisms
View on GitHub
This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"
☆16Apr 8, 2024Updated 2 years ago
hyx16 / SPMIArray
View on GitHub
Tsinghua University SPMI Lab array processing toolkit
☆18Nov 23, 2016Updated 9 years ago
asteroid-team / asteroid-filterbanks
View on GitHub
Asteroid's filterbanks
☆90Jan 12, 2025Updated last year
popcornell / MicRank
View on GitHub
MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.
☆22Apr 8, 2021Updated 5 years ago
ankitshah009 / WALNet-Weak_Label_Analysis
View on GitHub
Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.
☆32Sep 13, 2023Updated 2 years ago
vinusankars / ESOLA
View on GitHub
Epoch-synchronous overlap-add (ESOLA) for time-and pitch-scale modification of speech signals.
☆23Jul 24, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sadhusamik / fdlp_spectrogram
View on GitHub
☆14Nov 28, 2022Updated 3 years ago
shincling / discreteSeparation
View on GitHub
The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".
☆12Oct 25, 2021Updated 4 years ago
ZhongshuHou / LSA
View on GitHub
Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)
☆28Sep 16, 2023Updated 2 years ago
sp-uhh / mp-gtf
View on GitHub
Multi-Phase Gammatone Filterbank (MP-GTF) construction for Python
☆48Apr 30, 2020Updated 6 years ago
ifnspaml / Components-Loss
View on GitHub
Components loss for neural networks in mask-based speech enhancement
☆33Nov 20, 2020Updated 5 years ago
asteroid-team / Libri_VAD
View on GitHub
Script to generate VAD dataset used in Asteroid recipe
☆21Sep 30, 2021Updated 4 years ago
ws-choi / AMSS-Net
View on GitHub
A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…
☆21Jul 4, 2021Updated 5 years ago
BUTSpeechFIT / vae_dolphin
View on GitHub
☆10Jan 26, 2021Updated 5 years ago
raymondxyy / strfnet-IS2020
View on GitHub
Official repo for the STRFNet system appeared in INTERSPEECH2020
☆12Mar 6, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
fakufaku / 2020_interspeech_gmdp
View on GitHub
Generalized Minimal Distortion Principle for Blind Source Separation
☆22Sep 16, 2020Updated 5 years ago
Lukelluke / MCD-MEL-CEPSTRAL-DISTANCE-MCD-application
View on GitHub
Mel cepstral distortion (MCD) computations in python. Use Merlin toolkit to convert .wav files to .gcm files. Work in all form of .wav fi…
☆22Sep 4, 2020Updated 5 years ago
fgnt / padertorch
View on GitHub
A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an …
☆72Feb 26, 2026Updated 4 months ago
Sytronik / thesia
View on GitHub
Thesia is a Multi-track Spectrogram / Waveform viewer
☆23Jul 17, 2026Updated last week
danpovey / filtering
View on GitHub
Utilities for resampling and filtering audio data
☆47Jan 9, 2020Updated 6 years ago
yuzhou-git / deep-casa
View on GitHub
Tensorflow implementation of deep CASA
☆65Jun 6, 2021Updated 5 years ago
OSU-slatelab / mimic-enhance
View on GitHub
Speech enhancement using mimic loss
☆16Oct 25, 2019Updated 6 years ago
aframires / freesound-loop-annotator
View on GitHub
A web app for annotating Freesound loops, and the tools to analyse the dataset created.
☆20Jul 6, 2023Updated 3 years ago
FrancoisGrondin / BIRD
View on GitHub
Big Impulse Response Dataset
☆159Oct 19, 2022Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
denfed / leaf-audio-pytorch
View on GitHub
Pytorch port of Google Research's LEAF Audio paper
☆91May 19, 2021Updated 5 years ago
dtake1336 / ERNN-for-speech-enhancement
View on GitHub
☆38Jul 20, 2020Updated 6 years ago
pquochuy / dcase2020-seld
View on GitHub
Source code of the DCASE 2020 SELD submission "Audio Event Detection and Localization with Multitask Regression Network"
☆17Jul 8, 2020Updated 6 years ago
jonashaag / audio-resampling-in-python
View on GitHub
Comparison of Python audio resampling implementations
☆54Jun 30, 2021Updated 5 years ago
sweetcocoa / crepe-pytorch
View on GitHub
Implementation of CREPE Pitch tracker with PyTorch
☆19Jan 28, 2020Updated 6 years ago
slp-rl / SC-PhASE
View on GitHub
This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (…
☆28Aug 8, 2022Updated 3 years ago
ArjaanAuinger / pyaudiodsptools
View on GitHub
Numpy Audio DSP Tools
☆215Mar 18, 2025Updated last year