jiaaro / pydubView external linksLinks
Manipulate audio with a simple and easy high level interface
☆9,727Jul 26, 2025Updated 6 months ago
Alternatives and similar repositories for pydub
Users that are interested in pydub are comparing it to the libraries listed below
Sorting:
- Python library for audio and music analysis☆8,186Feb 5, 2026Updated last week
- Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications☆6,216Aug 4, 2025Updated 6 months ago
- Python bindings for FFmpeg - with complex filtering support☆10,946Aug 4, 2024Updated last year
- Video editing with Python☆14,333Sep 25, 2025Updated 4 months ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆8,941Jan 2, 2026Updated last month
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆9,159Updated this week
- End-to-End Speech Processing Toolkit☆9,722Feb 5, 2026Updated last week
- Python interface to the WebRTC Voice Activity Detector☆2,443Jul 4, 2024Updated last year
- A PyTorch-based Speech Toolkit☆11,203Updated this week
- SoundFile is an audio library based on libsndfile, CFFI, and NumPy☆816Jan 11, 2026Updated last month
- A Fast, Extensible Progress Bar for Python and CLI☆30,948Feb 4, 2026Updated last week
- kaldi-asr/kaldi is the official location of the Kaldi project.☆15,322Sep 22, 2025Updated 4 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆94,315Dec 15, 2025Updated 2 months ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,143Sep 30, 2025Updated 4 months ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,201Nov 27, 2025Updated 2 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,823Feb 8, 2026Updated last week
- Audio fingerprinting and recognition in Python☆6,722Apr 22, 2024Updated last year
- Play and Record Sound with Python☆1,221Jan 23, 2026Updated 3 weeks ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆44,516Aug 16, 2024Updated last year
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…☆26,726Jun 19, 2025Updated 7 months ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆8,125Dec 30, 2025Updated last month
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.☆28,108Feb 1, 2026Updated 2 weeks ago
- Python packaging and dependency management made easy☆34,199Updated this week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆20,051Updated this week
- The uncompromising Python code formatter☆41,376Feb 6, 2026Updated last week
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,421Oct 20, 2021Updated 4 years ago
- FastAPI framework, high performance, easy to learn, fast to code, ready for production☆95,033Updated this week
- Deezer source separation library including pretrained models.☆28,033Apr 2, 2025Updated 10 months ago
- Streamlit — A faster way to build and share data apps.☆43,477Updated this week
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,230Dec 27, 2025Updated last month
- Rich is a Python library for rich text and beautiful formatting in the terminal.☆55,429Feb 1, 2026Updated 2 weeks ago
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆41,698Updated this week
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆156,440Updated this week
- 🎛 🔊 A Python library for audio.☆5,964Feb 2, 2026Updated last week
- Faster Whisper transcription with CTranslate2☆20,833Nov 19, 2025Updated 2 months ago
- Python wrapper around sox.☆536Mar 26, 2025Updated 10 months ago
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆16,738Updated this week
- Python logging made (stupidly) simple☆23,584Jan 15, 2026Updated last month
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆59,336Dec 15, 2025Updated 2 months ago