Automated speech dataset creator
β217Jun 12, 2025Updated 8 months ago
Alternatives and similar repositories for Voice_Extractor
Users that are interested in Voice_Extractor are comparing it to the libraries listed below
Sorting:
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨β135Aug 10, 2025Updated 6 months ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.β33Feb 12, 2025Updated last year
- An AI tool designed to generate explanations for every file in a projectβ14Mar 7, 2025Updated 11 months ago
- A random walk voice style cloning application for Kokoro text to speechβ213Jun 16, 2025Updated 8 months ago
- This project allows to launch your Telegram bot in a few minutes to communicate with free or paid AI models via OpenRouter.β81Aug 21, 2025Updated 6 months ago
- Hector RAG is a modular RAG framework built on PostgreSQL, offering advanced retrieval methods and fusion techniques for AI-driven applicβ¦β60Feb 24, 2025Updated last year
- β27Jun 11, 2025Updated 8 months ago
- β13Mar 10, 2025Updated 11 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)β51Oct 18, 2024Updated last year
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. Fβ¦β289Apr 14, 2025Updated 10 months ago
- A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.β27Mar 8, 2025Updated 11 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI β¦β56Feb 10, 2025Updated last year
- β17Dec 16, 2024Updated last year
- A Multi-Agentic AI Assistant/Builderβ25Jan 23, 2026Updated last month
- Realtime demo, Streaming and Finetuning code for CSMβ443Sep 17, 2025Updated 5 months ago
- Easy to use and open-source unknown stealerβ22Jul 24, 2023Updated 2 years ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.β31May 1, 2025Updated 10 months ago
- Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100% open sourceβ327Jun 12, 2025Updated 8 months ago
- Extract2MD is a powerful and versatile AI-enabled client-side JavaScript library for extracting text from PDF files and converting it intβ¦β104Feb 7, 2026Updated 3 weeks ago
- Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flipβ37Jan 27, 2026Updated last month
- Audiobook Creator is an app that converts books (EPUB, PDF, TXT etc.) into fully voiced audiobooks with intelligent character voice attriβ¦β462Nov 17, 2025Updated 3 months ago
- β29Dec 20, 2025Updated 2 months ago
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learnerβ¦β18Updated this week
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Aug 5, 2024Updated last year
- Deploy Apollo HF space locallyβ40Dec 16, 2024Updated last year
- FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a β¦β10Jan 29, 2026Updated last month
- β10Apr 8, 2024Updated last year
- β12May 30, 2025Updated 9 months ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ11Jul 22, 2023Updated 2 years ago
- β15Apr 9, 2025Updated 10 months ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. β¦β13Dec 4, 2024Updated last year
- β33Jan 30, 2025Updated last year
- Create text chunks which end at natural stopping points without using a tokenizerβ26Nov 26, 2025Updated 3 months ago
- Analyze Reddit postsβ30Feb 27, 2025Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)β30Jun 9, 2025Updated 8 months ago
- Controllable Language Model Interactions in TypeScriptβ10May 17, 2024Updated last year
- PDF reader with Google Translate embedded in SwiftUIβ13Sep 16, 2019Updated 6 years ago
- Simple Tool Caller for llama.cppβ11Aug 12, 2024Updated last year
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.β25Dec 11, 2025Updated 2 months ago