Automated speech dataset creator
☆218Jun 12, 2025Updated 9 months ago
Alternatives and similar repositories for Voice_Extractor
Users that are interested in Voice_Extractor are comparing it to the libraries listed below
Sorting:
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆34Feb 12, 2025Updated last year
- ☆27Jun 11, 2025Updated 9 months ago
- ☆13Mar 10, 2025Updated last year
- A GTK4-based text-to-speech and AI assistant app in Rust, featuring PDF reading and LLM chat powered by Kokoro TTS☆20Feb 20, 2026Updated last month
- This project allows to launch your Telegram bot in a few minutes to communicate with free or paid AI models via OpenRouter.☆83Aug 21, 2025Updated 7 months ago
- An AI tool designed to generate explanations for every file in a project☆14Mar 7, 2025Updated last year
- Local modular AI assistant with speech, vision, and robotics support. Uses Qwen3-VL-4B in LM Studio.☆52Jan 9, 2026Updated 2 months ago
- Deploy Apollo HF space locally☆40Dec 16, 2024Updated last year
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆51Oct 18, 2024Updated last year
- A random walk voice style cloning application for Kokoro text to speech☆216Jun 16, 2025Updated 9 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆56Feb 10, 2025Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31May 1, 2025Updated 10 months ago
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- Extract2MD is a powerful and versatile AI-enabled client-side JavaScript library for extracting text from PDF files and converting it int…☆105Feb 7, 2026Updated last month
- Analyze Reddit posts☆30Feb 27, 2025Updated last year
- A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.☆27Mar 8, 2025Updated last year
- Audiobook Creator is an app that converts books (EPUB, PDF, TXT etc.) into fully voiced audiobooks with intelligent character voice attri…☆465Nov 17, 2025Updated 4 months ago
- Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100% open source☆330Jun 12, 2025Updated 9 months ago
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…☆57Feb 24, 2026Updated 3 weeks ago
- A utility that uses Whisper to transcribe videos and various translation APIs to translate the transcribed text and save them as SRT (sub…☆74Aug 30, 2024Updated last year
- Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip☆37Jan 27, 2026Updated last month
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM☆77May 19, 2025Updated 10 months ago
- A Multi-Agentic AI Assistant/Builder☆25Jan 23, 2026Updated last month
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆293Apr 14, 2025Updated 11 months ago
- Realtime demo, Streaming and Finetuning code for CSM☆446Sep 17, 2025Updated 6 months ago
- ☆15Updated this week
- ☆17Dec 16, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- The rag pipeline for optimizing dynamic data editing.☆21Oct 30, 2025Updated 4 months ago
- Run Orpheus 3B Locally With LM Studio☆32Mar 20, 2025Updated last year
- ☆33Jan 30, 2025Updated last year
- FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a …☆10Jan 29, 2026Updated last month
- ComfyUI custom node to pause a workflow☆33May 5, 2025Updated 10 months ago
- A transformers implementation of csm-streaming☆28May 16, 2025Updated 10 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Nov 26, 2025Updated 3 months ago
- ComfyUI custom node implementation of VideoMaMa for video matting with mask conditioning.☆46Feb 9, 2026Updated last month
- ☆135May 2, 2025Updated 10 months ago
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year