A simple audio feature extraction library
☆81Jul 3, 2019Updated 6 years ago
Alternatives and similar repositories for sonopy
Users that are interested in sonopy are comparing it to the libraries listed below
Sorting:
- This repository is for wake-word detection in speech using recurrent neural networks☆17Feb 25, 2019Updated 7 years ago
- Desktop GUI applications to show audio waveform and spectrogram which is visual representation of sound using the amplitude of the freque…☆12Jul 21, 2023Updated 2 years ago
- Waveflow: signal processing with tensorflow.☆13May 21, 2018Updated 7 years ago
- Create speaker voiceprints from a few seconds of audio. And, identify individuals in real-time streaming or recorded conversations.☆14Feb 4, 2019Updated 7 years ago
- SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/☆885Dec 15, 2024Updated last year
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆111Mar 19, 2024Updated last year
- 🎤 quick library to extract pause lengths from audio files.☆32Jun 5, 2019Updated 6 years ago
- Automatic Arabic diacritics restoration tool.☆18Aug 12, 2021Updated 4 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Oct 10, 2019Updated 6 years ago
- TensorFlow code for the paper Convolutional RNN: an Enhanced Model for Extracting Features from Sequential Data (https://arxiv.org/abs/16…☆32May 29, 2017Updated 8 years ago
- Self-contained Python package for OpenFst☆51Feb 1, 2023Updated 3 years ago
- Audio Keyword Search☆12May 5, 2019Updated 6 years ago
- A rigid, lightweight, dead-simple intent parser☆11May 30, 2022Updated 3 years ago
- Real-time speech enhancement based on spectral subtraction☆16Feb 18, 2018Updated 8 years ago
- The 1st place solution for AutoSpeech 2019.☆17Jun 9, 2020Updated 5 years ago
- Me building a simple synthesizer and sequencer to learn about Web Audio. Check out the wiki.☆16Oct 11, 2021Updated 4 years ago
- A spectrograph display in your terminal☆17Oct 30, 2018Updated 7 years ago
- Speech Enhancement using Bayesian WaveNet☆98Apr 1, 2018Updated 7 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Oct 4, 2019Updated 6 years ago
- Identify sounds in short audio clips☆156Sep 17, 2025Updated 5 months ago
- Medium Articles Notebooks and Media Files☆16Apr 11, 2024Updated last year
- ☆20Jul 22, 2022Updated 3 years ago
- Universal Deep neural network based speech enhancement demo and tools, well pre-trained DNN model☆67Feb 23, 2023Updated 3 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Mar 18, 2019Updated 6 years ago
- speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN☆35Apr 16, 2018Updated 7 years ago
- Deep neural network based speech enhancement toolkit☆218Jun 14, 2019Updated 6 years ago
- Tools for speech processing, keyword spotting☆17Mar 11, 2020Updated 5 years ago
- ☆17Feb 1, 2021Updated 5 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆69Jan 8, 2021Updated 5 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114May 7, 2019Updated 6 years ago
- Generate vector embeddings for music☆18Nov 7, 2017Updated 8 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Neural Turing machine for source separation in Tensorflow☆18Aug 16, 2017Updated 8 years ago
- ☆20Nov 22, 2020Updated 5 years ago
- Official repository for RawNet, RawNet2, and RawNet3☆397Mar 21, 2024Updated last year
- Discriminative Neural Clustering for Speaker Diarisation☆79Apr 8, 2022Updated 3 years ago
- Code for prefix beam search tutorial by @labodk☆187Dec 9, 2020Updated 5 years ago
- A Chainer implementation of ClariNet.☆45Nov 19, 2018Updated 7 years ago
- A toolkit to implement segmentation on speech based on BIC and nerual network, such as BiLSTM☆123Aug 7, 2019Updated 6 years ago