SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enhancement, audio augmentation, feature extraction and visualization, dataset and audio file conversion, and beyond.
☆77Jan 19, 2025Updated last year
Alternatives and similar repositories for Python-Sound-Tool
Users that are interested in Python-Sound-Tool are comparing it to the libraries listed below
Sorting:
- A selective noise filter architecture driven by a CNN and Wiener filter☆18Nov 21, 2019Updated 6 years ago
- ICASSP2022 TTS&VC Summary☆14Jun 9, 2022Updated 3 years ago
- An Attention-based Neural Network Approach for Single Channel Speech Enhancement☆25Dec 1, 2019Updated 6 years ago
- Python library for handling audio datasets.☆138Jul 6, 2023Updated 2 years ago
- A Convolutional Neural Network based Voice Activity Detector for Smartphones☆70Apr 30, 2019Updated 6 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.☆67Sep 9, 2019Updated 6 years ago
- An open-source speech separation and enhancement library☆214May 13, 2020Updated 5 years ago
- ☆23Apr 25, 2022Updated 3 years ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- Unbounded cache model for online language modeling with open vocabulary☆11Feb 15, 2019Updated 7 years ago
- A Visualizer for prosodically annotated speech corpora☆12Oct 27, 2021Updated 4 years ago
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- ☆24Jul 22, 2019Updated 6 years ago
- Generative Adversarial Networks for different impaired speech conversions☆38Jul 6, 2023Updated 2 years ago
- ☆38Jul 20, 2020Updated 5 years ago
- Automated, end-to-end wakeword model maker using the Precise Wakeword Engine☆27Feb 23, 2022Updated 4 years ago
- Image-source method for room acoustics☆14Feb 5, 2020Updated 6 years ago
- Python library for audio augmentation☆85Jul 6, 2023Updated 2 years ago
- A library for speech data augmentation in time-domain☆682Aug 30, 2021Updated 4 years ago
- A Python toolbox for speech features extraction☆165Feb 8, 2023Updated 3 years ago
- Convert images to audio for display in a spectrogram☆12Apr 17, 2018Updated 7 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Feb 20, 2018Updated 8 years ago
- speech enhancement algorithms for microphone arrays☆15May 12, 2020Updated 5 years ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- ☆32Apr 1, 2023Updated 2 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 3 years ago
- SelfRemaster: SSL Speech Restoration☆94Jan 5, 2024Updated 2 years ago
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆54May 25, 2022Updated 3 years ago
- Components loss for neural networks in mask-based speech enhancement☆33Nov 20, 2020Updated 5 years ago
- ☆16Feb 9, 2024Updated 2 years ago
- Matlab tools for pathological voice analysis☆13May 12, 2023Updated 2 years ago
- ☆12Nov 5, 2019Updated 6 years ago
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…☆223Mar 24, 2023Updated 2 years ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,036Jul 5, 2023Updated 2 years ago
- Voice conversion (VC) investigation using three variants of VAE☆59Oct 28, 2019Updated 6 years ago
- (tensorflow) Wiener Filter based Speech Enhancement(LSTM/BLSTM, GRU/BGRU, Transformer)☆15Dec 3, 2019Updated 6 years ago