SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enhancement, audio augmentation, feature extraction and visualization, dataset and audio file conversion, and beyond.
☆78Jan 19, 2025Updated last year
Alternatives and similar repositories for Python-Sound-Tool
Users that are interested in Python-Sound-Tool are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A selective noise filter architecture driven by a CNN and Wiener filter☆17Nov 21, 2019Updated 6 years ago
- A smartphone applications with Convolutional Neural Network Voice Activity Detector, Adaptive Noise Reduction and Dynamic Audio Range Com…☆21Apr 30, 2019Updated 7 years ago
- A Convolutional Neural Network based Voice Activity Detector for Smartphones☆70Apr 30, 2019Updated 7 years ago
- An Attention-based Neural Network Approach for Single Channel Speech Enhancement☆25Dec 1, 2019Updated 6 years ago
- ICASSP2022 TTS&VC Summary☆14Jun 9, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Python library for handling audio datasets.☆139Jul 6, 2023Updated 2 years ago
- ☆38Jul 20, 2020Updated 5 years ago
- A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.☆69Sep 9, 2019Updated 6 years ago
- Speech denoiser model using Keras☆20Jan 23, 2019Updated 7 years ago
- A library for speech data augmentation in time-domain☆687Aug 30, 2021Updated 4 years ago
- An open-source speech separation and enhancement library☆214May 13, 2020Updated 6 years ago
- 网络出处:Interactive Speech and Noise Modeling for Speech Enhancement☆28Jan 10, 2022Updated 4 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Convert images to audio for display in a spectrogram☆12Apr 17, 2018Updated 8 years ago
- ☆23Apr 25, 2022Updated 4 years ago
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…☆224Mar 24, 2023Updated 3 years ago
- A Visualizer for prosodically annotated speech corpora☆12Oct 27, 2021Updated 4 years ago
- Generative Adversarial Networks for different impaired speech conversions☆39Jul 6, 2023Updated 2 years ago
- A Python Library for Fundamental Frequency Estimation in Music Recordings☆57Jan 16, 2026Updated 4 months ago
- A Python toolbox for speech features extraction☆165Feb 8, 2023Updated 3 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Code and audio files associated with the paper "Speech Enhancement with Variance Constrained Autoencoders" presented at Interspeech 2019☆15Oct 10, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- Unbounded cache model for online language modeling with open vocabulary☆11Feb 15, 2019Updated 7 years ago
- SelfRemaster: SSL Speech Restoration☆94Jan 5, 2024Updated 2 years ago
- A musical ode to musical code☆17Jan 24, 2022Updated 4 years ago
- ☆24Jul 22, 2019Updated 6 years ago
- Matlab tools for pathological voice analysis☆14May 12, 2023Updated 3 years ago
- compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder☆20Jul 10, 2018Updated 7 years ago
- ☆18Jul 22, 2024Updated last year
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,050Jul 5, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Chinese version of A Neural Parametric Singing Synthesizer☆13Feb 12, 2022Updated 4 years ago
- Main Melody Extraction with Source-Filter NMF and CRNN☆25Apr 8, 2019Updated 7 years ago
- ☆32Apr 1, 2023Updated 3 years ago
- keras project for audio deep learning☆40Apr 10, 2018Updated 8 years ago
- speech enhancement algorithms for microphone arrays☆15May 12, 2020Updated 6 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year