Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
☆6,244Aug 4, 2025Updated 10 months ago
Alternatives and similar repositories for pyAudioAnalysis
Users that are interested in pyAudioAnalysis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python AUdio Recording and Analysis (paura)☆226Jul 6, 2023Updated 2 years ago
- Python library for audio and music analysis☆8,478Updated this week
- Manipulate audio with a simple and easy high level interface☆9,776Mar 19, 2026Updated 3 months ago
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,423Oct 20, 2021Updated 4 years ago
- An audio/acoustic activity detection and audio segmentation tool☆854May 14, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,889Updated this week
- Pytorch implementation of deep audio embedding calculation☆106Jul 23, 2023Updated 2 years ago
- Audio fingerprinting and recognition in Python☆6,770Apr 22, 2024Updated 2 years ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆10,202Updated this week
- a library for audio and music analysis☆3,712Apr 10, 2026Updated 2 months ago
- C++ library for audio and music analysis, description and synthesis, including Python bindings☆3,603Jun 15, 2026Updated 2 weeks ago
- Python interface to the WebRTC Voice Activity Detector☆2,490Jul 4, 2024Updated last year
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,878Jun 1, 2026Updated last month
- kaldi-asr/kaldi is the official location of the Kaldi project.☆15,417Sep 22, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Python audio and music signal processing library☆1,664Mar 20, 2026Updated 3 months ago
- kapre: Keras Audio Preprocessors☆945May 17, 2026Updated last month
- Curated list of python software and packages related to scientific research in audio☆1,694Jun 11, 2026Updated 2 weeks ago
- A PyTorch-based Speech Toolkit☆11,659Jun 15, 2026Updated 2 weeks ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆898Mar 12, 2026Updated 3 months ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,588Sep 25, 2024Updated last year
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆8,974Jun 16, 2026Updated 2 weeks ago
- Audio features extraction☆248Jun 21, 2021Updated 5 years ago
- ESC-50: Dataset for Environmental Sound Classification☆1,833Mar 20, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…☆26,759Jun 19, 2025Updated last year
- End-to-End Speech Processing Toolkit☆9,868Jun 24, 2026Updated last week
- Python examples for the course "Multimodal Information Processing & Analysis" of the MSc in Data Science in NCSR Demokritos☆98Jul 6, 2023Updated 2 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,292Apr 13, 2026Updated 2 months ago
- Instructional notebooks on music information retrieval.☆1,277May 19, 2026Updated last month
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆500Jul 1, 2021Updated 5 years ago
- Praat in Python, the Pythonic way☆1,267Jun 23, 2026Updated last week
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,399Mar 14, 2022Updated 4 years ago
- ☆1,762Jul 25, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,242Apr 28, 2021Updated 5 years ago
- Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)☆1,850Aug 19, 2025Updated 10 months ago
- The PyTorch-based audio source separation toolkit for researchers☆2,574May 13, 2026Updated last month
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,233Sep 30, 2025Updated 9 months ago
- Audio feature extraction and classification☆226Jul 6, 2023Updated 2 years ago
- Magenta: Music and Art Generation with Machine Intelligence☆19,798Jan 6, 2026Updated 5 months ago
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…☆1,899Mar 14, 2023Updated 3 years ago