Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
☆6,235Aug 4, 2025Updated 7 months ago
Alternatives and similar repositories for pyAudioAnalysis
Users that are interested in pyAudioAnalysis are comparing it to the libraries listed below
Sorting:
- Python AUdio Recording and Analysis (paura)☆226Jul 6, 2023Updated 2 years ago
- Python library for audio and music analysis☆8,276Updated this week
- Manipulate audio with a simple and easy high level interface☆9,744Jul 26, 2025Updated 7 months ago
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,422Oct 20, 2021Updated 4 years ago
- An audio/acoustic activity detection and audio segmentation tool☆841Dec 11, 2024Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,844Updated this week
- Pytorch implementation of deep audio embedding calculation☆106Jul 23, 2023Updated 2 years ago
- Audio fingerprinting and recognition in Python☆6,733Apr 22, 2024Updated last year
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,356Mar 12, 2026Updated last week
- a library for audio and music analysis☆3,668Nov 20, 2025Updated 4 months ago
- C++ library for audio and music analysis, description and synthesis, including Python bindings☆3,456Feb 9, 2026Updated last month
- Python interface to the WebRTC Voice Activity Detector☆2,451Jul 4, 2024Updated last year
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,853Jul 22, 2025Updated 8 months ago
- kaldi-asr/kaldi is the official location of the Kaldi project.☆15,348Sep 22, 2025Updated 6 months ago
- Python audio and music signal processing library☆1,601Aug 25, 2024Updated last year
- kapre: Keras Audio Preprocessors☆946Oct 26, 2025Updated 4 months ago
- Curated list of python software and packages related to scientific research in audio☆1,684Jan 19, 2026Updated 2 months ago
- A PyTorch-based Speech Toolkit☆11,330Mar 1, 2026Updated 3 weeks ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆872Mar 12, 2026Updated last week
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,589Sep 25, 2024Updated last year
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆8,961Mar 11, 2026Updated last week
- ESC-50: Dataset for Environmental Sound Classification☆1,765Mar 20, 2024Updated 2 years ago
- Audio features extraction☆248Jun 21, 2021Updated 4 years ago
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…☆26,737Jun 19, 2025Updated 9 months ago
- End-to-End Speech Processing Toolkit☆9,780Updated this week
- Python examples for the course "Multimodal Information Processing & Analysis" of the MSc in Data Science in NCSR Demokritos☆98Jul 6, 2023Updated 2 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,241Dec 27, 2025Updated 2 months ago
- Instructional notebooks on music information retrieval.☆1,267Feb 11, 2026Updated last month
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆498Jul 1, 2021Updated 4 years ago
- Praat in Python, the Pythonic way☆1,242Mar 2, 2026Updated 2 weeks ago
- ☆1,680Jul 25, 2024Updated last year
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,396Mar 14, 2022Updated 4 years ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,235Apr 28, 2021Updated 4 years ago
- Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)☆1,829Aug 19, 2025Updated 7 months ago
- The PyTorch-based audio source separation toolkit for researchers☆2,546Oct 6, 2025Updated 5 months ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,190Sep 30, 2025Updated 5 months ago
- Audio feature extraction and classification☆228Jul 6, 2023Updated 2 years ago
- Magenta: Music and Art Generation with Machine Intelligence☆19,776Jan 6, 2026Updated 2 months ago
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…☆1,881Mar 14, 2023Updated 3 years ago