psdwizzard / XTTS-Read-AloudLinks
This Chrome extension integrates screen reader functionality using the XttS-webui API. Currently in beta and using the XttS Server API backend, it will soon move to AllTalk. It enhances web accessibility with seamless text-to-speech capabilities. Licensed under the MIT License for unrestricted and commercial use
☆28Updated 3 months ago
Alternatives and similar repositories for XTTS-Read-Aloud
Users that are interested in XTTS-Read-Aloud are comparing it to the libraries listed below
Sorting:
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated last year
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆49Updated 4 months ago
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆47Updated 5 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆99Updated 3 months ago
- ☆50Updated 7 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- ☆47Updated 3 months ago
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆43Updated 5 months ago
- OpenPipe Reinforcement Learning Experiments☆25Updated 3 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆81Updated 8 months ago
- 100% Local Document deep search with LLMs☆26Updated 9 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 10 months ago
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆62Updated this week
- An API for VoiceCraft.☆25Updated last year
- ☆21Updated 2 months ago
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- ☆40Updated last year
- Create text chunks which end at natural stopping points without using a tokenizer☆25Updated 3 months ago
- A random walk voice style cloning application for Kokoro text to speech☆99Updated last week
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆36Updated 3 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 6 months ago
- Ollama models of NousResearch/Hermes-2-Pro-Mistral-7B-GGUF☆32Updated last year
- Experimental sampler to make LLMs more creative☆31Updated last year
- Make Qwen3 Think like Gemini 2.5 Pro | Open webui function☆21Updated last month
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 3 months ago
- run ollama & gguf easily with a single command☆51Updated last year
- Complex RAG backend☆28Updated last year
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 8 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆36Updated 11 months ago
- Anthropic Computer Use with Modal Sandboxes☆36Updated 8 months ago