jack-tol / youtube-to-audioLinks
A lightweight Python package and command-line interface (CLI) tool that extracts audio from YouTube videos and playlists in multiple formats, such as MP3, WAV, OGG, AAC, and FLAC.
☆13Updated 5 months ago
Alternatives and similar repositories for youtube-to-audio
Users that are interested in youtube-to-audio are comparing it to the libraries listed below
Sorting:
- Sing an idea ➡️ AI music sample🔥🎶☆116Updated last year
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆216Updated 2 weeks ago
- Efficient approach to speaker diarization using voice characteristics extraction☆99Updated 2 months ago
- ☆83Updated last week
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- Video+code lecture on building nanoGPT from scratch☆69Updated last year
- Joint speech-language model - respond directly to audio!☆371Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- ☆127Updated 5 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated last year
- ☆37Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆67Updated last month
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆136Updated last year
- Simli WebRTC AI Agent demo☆23Updated 9 months ago
- Chat to Compose Video☆193Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆183Updated 4 months ago
- Examples of apps built with Nendo, the AI Audio Tool Suite☆55Updated last year
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated 2 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 8 months ago
- ☆262Updated last year
- YOLOv10: Real-Time End-to-End Object Detection☆11Updated last year
- A python library to find differences between audio and transcriptions☆20Updated last year
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆23Updated 5 months ago
- The next evolution of Agents☆47Updated 2 weeks ago
- Collection of Open Source Speech Data☆159Updated 9 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- ☆157Updated 2 years ago
- ☆24Updated last year
- Open-source AI for voice control, rivaling Alexa and Siri☆12Updated last year
- ☆62Updated last year