jack-tol / youtube-to-audioLinks
A lightweight Python package and command-line interface (CLI) tool that extracts audio from YouTube videos and playlists in multiple formats, such as MP3, WAV, OGG, AAC, and FLAC.
☆18Updated 10 months ago
Alternatives and similar repositories for youtube-to-audio
Users that are interested in youtube-to-audio are comparing it to the libraries listed below
Sorting:
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆219Updated 3 weeks ago
- ☆127Updated 10 months ago
- Sing an idea ➡️ AI music sample🔥🎶☆119Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆106Updated 7 months ago
- A set of tools to download your music from Suno.ai with organized filenames and prompts.☆22Updated last year
- YOLOv10: Real-Time End-to-End Object Detection☆11Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆70Updated 3 months ago
- Simli WebRTC AI Agent demo☆24Updated last year
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆64Updated 10 months ago
- Examples of apps built with Nendo, the AI Audio Tool Suite☆55Updated last year
- Simulates talk with an AI that can express emotions☆82Updated 7 months ago
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4☆24Updated 2 years ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆46Updated 7 months ago
- A unified library for interacting with various AI APIs through a standardized interface.☆31Updated 10 months ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆108Updated 2 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆86Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆22Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆60Updated last year
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆48Updated last year
- Your personal and private AI☆55Updated 9 months ago
- Chat to Compose Video☆198Updated 2 years ago
- ☆37Updated 2 years ago
- ☆24Updated last year
- ☆83Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆82Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆141Updated last year