a-n-rose/Python-Sound-Tool

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/a-n-rose/Python-Sound-Tool)

a-n-rose / Python-Sound-Tool

SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enhancement, audio augmentation, feature extraction and visualization, dataset and audio file conversion, and beyond.

☆79

Alternatives and similar repositories for Python-Sound-Tool

Users that are interested in Python-Sound-Tool are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pgys / NoIze
View on GitHub
A selective noise filter architecture driven by a CNN and Wiener filter
☆17Nov 21, 2019Updated 6 years ago
SIP-Lab / Integrated-Hearing-Aid-App
View on GitHub
A smartphone applications with Convolutional Neural Network Voice Activity Detector, Adaptive Noise Reduction and Dynamic Audio Range Com…
☆22Apr 30, 2019Updated 7 years ago
chanil1218 / Attention-SE.pytorch
View on GitHub
An Attention-based Neural Network Approach for Single Channel Speech Enhancement
☆25Dec 1, 2019Updated 6 years ago
SIP-Lab / CNN-VAD
View on GitHub
A Convolutional Neural Network based Voice Activity Detector for Smartphones
☆70Apr 30, 2019Updated 7 years ago
lmxue / ICASSP2022_TTS_VC_Summary
View on GitHub
ICASSP2022 TTS&VC Summary
☆13Jun 9, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ynop / audiomate
View on GitHub
Python library for handling audio datasets.
☆139Jul 6, 2023Updated 3 years ago
rishikksh20 / NU-Wave2-pytorch
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆25Jul 5, 2022Updated 4 years ago
dtake1336 / ERNN-for-speech-enhancement
View on GitHub
☆38Jul 20, 2020Updated 6 years ago
daanzu / wenet_stt_python
View on GitHub
☆33Nov 27, 2021Updated 4 years ago
cyrta / awesome-speech-enhancement
View on GitHub
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
☆69Sep 9, 2019Updated 6 years ago
bill9800 / Speech-denoise-Autoencoder
View on GitHub
Speech denoiser model using Keras
☆20Jan 23, 2019Updated 7 years ago
rrkarim / unbounded-cache-lm
View on GitHub
Unbounded cache model for online language modeling with open vocabulary
☆11Feb 15, 2019Updated 7 years ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
speechLabBcCuny / onssen
View on GitHub
An open-source speech separation and enhancement library
☆214May 13, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Gitxiaoke / SNnet
View on GitHub
网络出处：Interactive Speech and Noise Modeling for Speech Enhancement
☆28Jan 10, 2022Updated 4 years ago
shareef12 / img2wav
View on GitHub
Convert images to audio for display in a spectrogram
☆13Apr 17, 2018Updated 8 years ago
ifnspaml / Enhancement-Coded-Speech
View on GitHub
☆24Apr 25, 2022Updated 4 years ago
shincling / discreteSeparation
View on GitHub
The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".
☆12Oct 25, 2021Updated 4 years ago
craigmacartney / Wave-U-Net-For-Speech-Enhancement
View on GitHub
Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…
☆224Mar 24, 2023Updated 3 years ago
Maitreyapatel / speech-conversion-between-different-modalities
View on GitHub
Generative Adversarial Networks for different impaired speech conversions
☆39Jul 6, 2023Updated 3 years ago
groupmm / libf0
View on GitHub
A Python Library for Fundamental Frequency Estimation in Music Recordings
☆55Jun 5, 2026Updated last month
alpoktem / Prosograph
View on GitHub
A Visualizer for prosodically annotated speech corpora
☆12Oct 27, 2021Updated 4 years ago
danielbraithwt / Speech-Enhancement-with-Variance-Constrained-Autoencoders
View on GitHub
Code and audio files associated with the paper "Speech Enhancement with Variance Constrained Autoencoders" presented at Interspeech 2019
☆15Oct 10, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
bootphon / shennong
View on GitHub
A Python toolbox for speech features extraction
☆166Feb 8, 2023Updated 3 years ago
Akella17 / speaker-embedding
View on GitHub
A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
☆10Feb 19, 2018Updated 8 years ago
Takaaki-Saeki / ssl_speech_restoration
View on GitHub
SelfRemaster: SSL Speech Restoration
☆94Jan 5, 2024Updated 2 years ago
psc-g / musicode
View on GitHub
A musical ode to musical code
☆17Jan 24, 2022Updated 4 years ago
ihp-lab / Speaker-Invariant-Domain-Adversarial-Neural-Networks
View on GitHub
☆11Sep 29, 2020Updated 5 years ago
DaoZhang0123 / compareCTCDecoder
View on GitHub
compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder
☆20Jul 10, 2018Updated 8 years ago
Mak-Sim / Troparion
View on GitHub
Matlab tools for pathological voice analysis
☆14May 12, 2023Updated 3 years ago
Sytronik / denoising-wavenet-pytorch
View on GitHub
☆24Jul 22, 2019Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
rithiksachdev / PostASR-Correction-SLT2024
View on GitHub
☆18Jul 22, 2024Updated 2 years ago
aliutkus / speechmetrics
View on GitHub
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
☆1,050Jul 5, 2023Updated 3 years ago
seaniezhao / cnnpss
View on GitHub
A Chinese version of A Neural Parametric Singing Synthesizer
☆13Feb 12, 2022Updated 4 years ago
DCASE2023-Task7-Foley-Sound-Synthesis / dcase2023_task7_baseline
View on GitHub
☆32Apr 1, 2023Updated 3 years ago
dogacbasaran / ismir2018_dominant_melody_estimation
View on GitHub
Main Melody Extraction with Source-Filter NMF and CRNN
☆25Apr 8, 2019Updated 7 years ago
ttslr / MonTTS
View on GitHub
☆16Dec 23, 2021Updated 4 years ago
Sreyan88 / LipGER
View on GitHub
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆19Jul 16, 2024Updated 2 years ago