jack-tol / youtube-to-audioLinks
A lightweight Python package and command-line interface (CLI) tool that extracts audio from YouTube videos and playlists in multiple formats, such as MP3, WAV, OGG, AAC, and FLAC.
☆17Updated 9 months ago
Alternatives and similar repositories for youtube-to-audio
Users that are interested in youtube-to-audio are comparing it to the libraries listed below
Sorting:
- Sing an idea ➡️ AI music sample🔥🎶☆119Updated last year
- YOLOv10: Real-Time End-to-End Object Detection☆11Updated last year
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆86Updated last year
- Examples of apps built with Nendo, the AI Audio Tool Suite☆55Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated last month
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆60Updated last year
- The next evolution of Agents☆48Updated last week
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆10Updated last year
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆46Updated 5 months ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆217Updated 3 months ago
- Joint speech-language model - respond directly to audio!☆372Updated last year
- ☆314Updated 3 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆99Updated last year
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 5 months ago
- [WIP] AI Try-On plugin for Chrome☆28Updated last year
- ☆37Updated 2 years ago
- ☆127Updated 8 months ago
- A python library to find differences between audio and transcriptions☆19Updated 2 years ago
- Open source video call conversational bot☆50Updated last month
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Retrieve the source code for any model made available on replicate.com!☆36Updated last year
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆36Updated 9 months ago
- Personalized all-purpose AI assistance platform based on hierarchical cooperative multi-agent framework which utilizes websocket connecti…☆39Updated last year
- An application that automatically generates Python codes based on GPT (as used in ChatGPT).☆46Updated 2 years ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆72Updated 4 months ago
- ☆23Updated last year
- Chat to Compose Video☆197Updated last year