MycroftAI / sonopy
A simple audio feature extraction library
☆80Updated 5 years ago
Alternatives and similar repositories for sonopy:
Users that are interested in sonopy are comparing it to the libraries listed below
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 4 years ago
- Python library for audio augmentation☆84Updated last year
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Updated last year
- Python library for handling audio datasets.☆137Updated last year
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 3 years ago
- A collection of utilities for Detection and Classification of Acoustic Scenes and Events☆128Updated last month
- Evaluation toolbox for Sound Event Detection☆147Updated 10 months ago
- Benchmark popular audio i/o packages☆140Updated last year
- Utils and data sets for audio and PyTorch☆85Updated 3 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- ASR with PyTorch☆139Updated 6 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆73Updated 3 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆32Updated 5 years ago
- ☆26Updated 7 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆102Updated 6 years ago
- Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.☆222Updated last year
- Feature extractor for DL speech processing.☆65Updated 3 years ago
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable wo…☆69Updated 7 years ago
- A Python toolbox for speech features extraction☆162Updated 2 years ago
- 1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging☆30Updated 4 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 4 years ago
- Visualization toolbox for Sound Event Detection☆119Updated last year
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…☆219Updated 2 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆79Updated 7 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago