Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
☆6,240Aug 4, 2025Updated 9 months ago
Alternatives and similar repositories for pyAudioAnalysis
Users that are interested in pyAudioAnalysis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python AUdio Recording and Analysis (paura)☆226Jul 6, 2023Updated 2 years ago
- Python library for audio and music analysis☆8,397May 11, 2026Updated last week
- Manipulate audio with a simple and easy high level interface☆9,756Mar 19, 2026Updated 2 months ago
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,423Oct 20, 2021Updated 4 years ago
- An audio/acoustic activity detection and audio segmentation tool☆848May 14, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,872May 14, 2026Updated last week
- Pytorch implementation of deep audio embedding calculation☆106Jul 23, 2023Updated 2 years ago
- Audio fingerprinting and recognition in Python☆6,753Apr 22, 2024Updated 2 years ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,953Updated this week
- a library for audio and music analysis☆3,704Apr 10, 2026Updated last month
- C++ library for audio and music analysis, description and synthesis, including Python bindings☆3,558May 8, 2026Updated 2 weeks ago
- Python interface to the WebRTC Voice Activity Detector☆2,477Jul 4, 2024Updated last year
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,867Jul 22, 2025Updated 10 months ago
- kaldi-asr/kaldi is the official location of the Kaldi project.☆15,392Sep 22, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Python audio and music signal processing library☆1,641Mar 20, 2026Updated 2 months ago
- kapre: Keras Audio Preprocessors☆945Oct 26, 2025Updated 6 months ago
- Curated list of python software and packages related to scientific research in audio☆1,692Jan 19, 2026Updated 4 months ago
- A PyTorch-based Speech Toolkit☆11,548May 13, 2026Updated last week
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆888Mar 12, 2026Updated 2 months ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,588Sep 25, 2024Updated last year
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆8,964Apr 24, 2026Updated 3 weeks ago
- Audio features extraction☆248Jun 21, 2021Updated 4 years ago
- ESC-50: Dataset for Environmental Sound Classification☆1,818Mar 20, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…☆26,751Jun 19, 2025Updated 11 months ago
- End-to-End Speech Processing Toolkit☆9,836May 14, 2026Updated last week
- Python examples for the course "Multimodal Information Processing & Analysis" of the MSc in Data Science in NCSR Demokritos☆98Jul 6, 2023Updated 2 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,270Apr 13, 2026Updated last month
- Instructional notebooks on music information retrieval.☆1,275Mar 25, 2026Updated last month
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆499Jul 1, 2021Updated 4 years ago
- Praat in Python, the Pythonic way☆1,260Apr 13, 2026Updated last month
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,398Mar 14, 2022Updated 4 years ago
- ☆1,734Jul 25, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,240Apr 28, 2021Updated 5 years ago
- Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)☆1,840Aug 19, 2025Updated 9 months ago
- The PyTorch-based audio source separation toolkit for researchers☆2,568May 13, 2026Updated last week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,219Sep 30, 2025Updated 7 months ago
- Audio feature extraction and classification☆227Jul 6, 2023Updated 2 years ago
- Magenta: Music and Art Generation with Machine Intelligence☆19,779Jan 6, 2026Updated 4 months ago
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…☆1,892Mar 14, 2023Updated 3 years ago