tyiannak / pyAudioAnalysisLinks
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
☆6,055Updated 2 months ago
Alternatives and similar repositories for pyAudioAnalysis
Users that are interested in pyAudioAnalysis are comparing it to the libraries listed below
Sorting:
- Python library for audio and music analysis☆7,637Updated last week
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,401Updated 3 years ago
- An audio/acoustic activity detection and audio segmentation tool☆778Updated 5 months ago
- Audio fingerprinting and recognition in Python☆6,561Updated last year
- Python interface to the WebRTC Voice Activity Detector☆2,246Updated 10 months ago
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…☆26,395Updated 8 months ago
- kapre: Keras Audio Preprocessors☆932Updated last year
- Instructional notebooks on music information retrieval.☆1,244Updated last year
- aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)☆2,661Updated 11 months ago
- kaldi-asr/kaldi is the official location of the Kaldi project.☆14,878Updated last month
- A collection of links and notes on forced alignment tools☆910Updated 3 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,673Updated this week
- A python package to analyze and compare voices with deep learning☆2,968Updated last year
- Python AUdio Recording and Analysis (paura)☆223Updated last year
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆806Updated 4 months ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆1,928Updated 11 months ago
- Command line utility for forced alignment using Kaldi☆1,486Updated this week
- A Python wrapper for Kaldi☆1,017Updated 4 months ago
- WaveNet vocoder☆2,356Updated last year
- Deep Speaker: an End-to-End Neural Speaker Embedding System.☆930Updated last year
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,181Updated 4 years ago
- Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…☆3,040Updated last year
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,427Updated 6 months ago
- Curated list of python software and packages related to scientific research in audio☆1,613Updated last year
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow☆3,985Updated 3 years ago
- Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)☆1,671Updated 5 months ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,387Updated 3 years ago
- Manipulate audio with a simple and easy high level interface☆9,390Updated 10 months ago
- The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.☆671Updated 7 years ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,574Updated 8 months ago