jack-tol / youtube-to-audio
A lightweight Python package and command-line interface (CLI) tool that extracts audio from YouTube videos and playlists in multiple formats, such as MP3, WAV, OGG, AAC, and FLAC.
☆12Updated 3 weeks ago
Alternatives and similar repositories for youtube-to-audio:
Users that are interested in youtube-to-audio are comparing it to the libraries listed below
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆61Updated 2 weeks ago
- Joint speech-language model - respond directly to audio!☆30Updated 10 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 3 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- ☆62Updated 8 months ago
- create dataset from list of youtube links easily☆17Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆57Updated 11 months ago
- Sing an idea ➡️ AI music sample🔥🎶☆102Updated 11 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated last year
- ☆39Updated last week
- Create an LJSpeech structured voice dataset on wave input☆27Updated 6 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆93Updated 11 months ago
- Agentic RAG to help you build a startup🚀☆16Updated 3 weeks ago
- Open TTS models, built for streaming on the edge☆39Updated 2 weeks ago
- ☆107Updated last year
- Simli WebRTC AI Agent demo☆20Updated 3 months ago
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Updated last year
- Use quantized versions of Whisper to speed up inference☆12Updated 5 months ago
- An open source real-time AI inference engine for seamless scaling☆17Updated 2 weeks ago
- Video+code lecture on building nanoGPT from scratch☆66Updated 9 months ago
- ☆37Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆157Updated last week
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 6 months ago
- Unsloth Studio☆74Updated 3 weeks ago
- ☆254Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆46Updated last month
- ☆17Updated 2 years ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated 11 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆25Updated last year
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆45Updated last month