Manipulate audio with a simple and easy high level interface
☆9,744Jul 26, 2025Updated 7 months ago
Alternatives and similar repositories for pydub
Users that are interested in pydub are comparing it to the libraries listed below
Sorting:
- Python library for audio and music analysis☆8,227Feb 20, 2026Updated last week
- Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications☆6,227Aug 4, 2025Updated 7 months ago
- Python bindings for FFmpeg - with complex filtering support☆10,960Aug 4, 2024Updated last year
- Video editing with Python☆14,388Sep 25, 2025Updated 5 months ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆8,958Jan 2, 2026Updated 2 months ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,274Feb 20, 2026Updated last week
- End-to-End Speech Processing Toolkit☆9,747Feb 26, 2026Updated last week
- Python interface to the WebRTC Voice Activity Detector☆2,446Jul 4, 2024Updated last year
- A PyTorch-based Speech Toolkit☆11,277Updated this week
- SoundFile is an audio library based on libsndfile, CFFI, and NumPy☆822Jan 11, 2026Updated last month
- A Fast, Extensible Progress Bar for Python and CLI☆30,985Feb 14, 2026Updated 2 weeks ago
- kaldi-asr/kaldi is the official location of the Kaldi project.☆15,331Sep 22, 2025Updated 5 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆95,206Dec 15, 2025Updated 2 months ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,170Sep 30, 2025Updated 5 months ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,254Nov 27, 2025Updated 3 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,833Updated this week
- Audio fingerprinting and recognition in Python☆6,725Apr 22, 2024Updated last year
- Play and Record Sound with Python☆1,226Jan 23, 2026Updated last month
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆44,691Aug 16, 2024Updated last year
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…☆26,736Jun 19, 2025Updated 8 months ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆8,279Feb 24, 2026Updated last week
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.☆28,140Updated this week
- Python packaging and dependency management made easy☆34,286Updated this week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆20,368Feb 22, 2026Updated last week
- The uncompromising Python code formatter☆41,410Updated this week
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,422Oct 20, 2021Updated 4 years ago
- FastAPI framework, high performance, easy to learn, fast to code, ready for production☆95,805Updated this week
- Streamlit — A faster way to build and share data apps.☆43,742Updated this week
- Deezer source separation library including pretrained models.☆28,077Apr 2, 2025Updated 11 months ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,232Dec 27, 2025Updated 2 months ago
- Rich is a Python library for rich text and beautiful formatting in the terminal.☆55,654Feb 26, 2026Updated last week
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆41,921Updated this week
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆157,071Updated this week
- 🎛 🔊 A Python library for audio.☆5,992Feb 2, 2026Updated last month
- Faster Whisper transcription with CTranslate2☆21,176Nov 19, 2025Updated 3 months ago
- Python wrapper around sox.☆538Mar 26, 2025Updated 11 months ago
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆16,843Updated this week
- Python logging made (stupidly) simple☆23,653Feb 22, 2026Updated last week
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆59,483Dec 15, 2025Updated 2 months ago