Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
☆6,235Aug 4, 2025Updated 8 months ago
Alternatives and similar repositories for pyAudioAnalysis
Users that are interested in pyAudioAnalysis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python AUdio Recording and Analysis (paura)☆226Jul 6, 2023Updated 2 years ago
- Python library for audio and music analysis☆8,296Mar 24, 2026Updated 2 weeks ago
- Manipulate audio with a simple and easy high level interface☆9,749Mar 19, 2026Updated 3 weeks ago
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,423Oct 20, 2021Updated 4 years ago
- An audio/acoustic activity detection and audio segmentation tool☆845Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,862Updated this week
- Pytorch implementation of deep audio embedding calculation☆106Jul 23, 2023Updated 2 years ago
- Audio fingerprinting and recognition in Python☆6,741Apr 22, 2024Updated last year
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,652Apr 1, 2026Updated last week
- a library for audio and music analysis☆3,681Nov 20, 2025Updated 4 months ago
- C++ library for audio and music analysis, description and synthesis, including Python bindings☆3,497Feb 9, 2026Updated 2 months ago
- Python interface to the WebRTC Voice Activity Detector☆2,455Jul 4, 2024Updated last year
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,858Jul 22, 2025Updated 8 months ago
- kaldi-asr/kaldi is the official location of the Kaldi project.☆15,367Sep 22, 2025Updated 6 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Python audio and music signal processing library☆1,617Mar 20, 2026Updated 3 weeks ago
- kapre: Keras Audio Preprocessors☆946Oct 26, 2025Updated 5 months ago
- Curated list of python software and packages related to scientific research in audio☆1,685Jan 19, 2026Updated 2 months ago
- A PyTorch-based Speech Toolkit☆11,422Apr 3, 2026Updated last week
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆879Mar 12, 2026Updated 3 weeks ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,588Sep 25, 2024Updated last year
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆8,964Mar 29, 2026Updated last week
- Audio features extraction☆248Jun 21, 2021Updated 4 years ago
- ESC-50: Dataset for Environmental Sound Classification☆1,787Mar 20, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…☆26,752Jun 19, 2025Updated 9 months ago
- End-to-End Speech Processing Toolkit☆9,801Updated this week
- Python examples for the course "Multimodal Information Processing & Analysis" of the MSc in Data Science in NCSR Demokritos☆98Jul 6, 2023Updated 2 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,251Dec 27, 2025Updated 3 months ago
- Instructional notebooks on music information retrieval.☆1,271Mar 25, 2026Updated 2 weeks ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆498Jul 1, 2021Updated 4 years ago
- Praat in Python, the Pythonic way☆1,246Mar 20, 2026Updated 3 weeks ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,398Mar 14, 2022Updated 4 years ago
- ☆1,702Jul 25, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,238Apr 28, 2021Updated 4 years ago
- Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)☆1,829Aug 19, 2025Updated 7 months ago
- The PyTorch-based audio source separation toolkit for researchers☆2,556Oct 6, 2025Updated 6 months ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,201Sep 30, 2025Updated 6 months ago
- Audio feature extraction and classification☆227Jul 6, 2023Updated 2 years ago
- Magenta: Music and Art Generation with Machine Intelligence☆19,768Jan 6, 2026Updated 3 months ago
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…☆1,882Mar 14, 2023Updated 3 years ago