Automated speech dataset creator
β220Jun 12, 2025Updated 10 months ago
Alternatives and similar repositories for Voice_Extractor
Users that are interested in Voice_Extractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasetsβ136Aug 10, 2025Updated 8 months ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.β34Feb 12, 2025Updated last year
- β27Jun 11, 2025Updated 10 months ago
- β13Mar 10, 2025Updated last year
- A GTK4-based text-to-speech and AI assistant app in Rust, featuring PDF reading and LLM chat powered by Kokoro TTSβ20Feb 20, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An AI tool designed to generate explanations for every file in a projectβ14Mar 7, 2025Updated last year
- Local modular AI assistant with speech, vision, and robotics support. Uses Qwen3-VL-4B in LM Studio.β53Jan 9, 2026Updated 3 months ago
- Deploy Apollo HF space locallyβ40Dec 16, 2024Updated last year
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)β51Oct 18, 2024Updated last year
- A random walk voice style cloning application for Kokoro text to speechβ240Apr 6, 2026Updated 3 weeks ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI β¦β57Feb 10, 2025Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.β31May 1, 2025Updated last year
- Simple Tool Caller for llama.cppβ11Aug 12, 2024Updated last year
- Hector RAG is a modular RAG framework built on PostgreSQL, offering advanced retrieval methods and fusion techniques for AI-driven applicβ¦β60Feb 24, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Extract2MD is a powerful and versatile AI-enabled client-side JavaScript library for extracting text from PDF files and converting it intβ¦β107Apr 7, 2026Updated 3 weeks ago
- A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.β27Mar 8, 2025Updated last year
- Analyze Reddit postsβ31Feb 27, 2025Updated last year
- Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100% open sourceβ335Jun 12, 2025Updated 10 months ago
- Audiobook Creator is an app that converts books (EPUB, PDF, TXT etc.) into fully voiced audiobooks with intelligent character voice attriβ¦β477Nov 17, 2025Updated 5 months ago
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-graβ¦β59Feb 24, 2026Updated 2 months ago
- Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flipβ37Jan 27, 2026Updated 3 months ago
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAMβ80May 19, 2025Updated 11 months ago
- A Multi-Agentic AI Assistant/Builderβ26Updated this week
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. Fβ¦β297Apr 14, 2025Updated last year
- β17Dec 16, 2024Updated last year
- β15Mar 18, 2026Updated last month
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. β¦β13Dec 4, 2024Updated last year
- Realtime demo, Streaming and Finetuning code for CSMβ454Sep 17, 2025Updated 7 months ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ11Jul 22, 2023Updated 2 years ago
- ComfyUI workflows to create smooth transitions between video clips using Wan VACE. Works with video from any model or other source-LTX-2,β¦β80Apr 11, 2026Updated 2 weeks ago
- The rag pipeline for optimizing dynamic data editing.β21Oct 30, 2025Updated 6 months ago
- Run Orpheus 3B Locally With LM Studioβ32Mar 20, 2025Updated last year
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- β33Jan 30, 2025Updated last year
- FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a β¦β10Apr 22, 2026Updated last week
- Create text chunks which end at natural stopping points without using a tokenizerβ26Nov 26, 2025Updated 5 months ago
- A transformers implementation of csm-streamingβ30May 16, 2025Updated 11 months ago
- β137May 2, 2025Updated 11 months ago
- An AI character interaction system with emotional modeling and advanced memory managementβ17Oct 26, 2024Updated last year
- Authenticated independently verifiable agent delegation.β33Dec 17, 2025Updated 4 months ago