mateogon / pdf-narratorLinks
Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient processing for low-resource systems.
☆82Updated 2 months ago
Alternatives and similar repositories for pdf-narrator
Users that are interested in pdf-narrator are comparing it to the libraries listed below
Sorting:
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆94Updated 3 weeks ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆96Updated 2 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆62Updated 2 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 7 months ago
- EPUB, PDF, DOCX, MD, and TXT file text to speech document reader. Read documents in realtime with high-quality TTS; or extract audiobooks…☆141Updated last week
- ☆76Updated 3 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆38Updated last week
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆151Updated last month
- Streaming for Chatterbox TTS☆48Updated this week
- A random walk voice style cloning application for Kokoro text to speech☆85Updated last week
- ☆46Updated 2 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 5 months ago
- Python language chat with Ollama models locally, anthropic and openai☆25Updated last month
- Your personal and private AI☆47Updated 2 months ago
- ☆95Updated last year
- 100% Local Document deep search with LLMs☆26Updated 8 months ago
- Self-hosted AI medical scribe.☆31Updated this week
- Examples of using the llasa-tts models locally☆171Updated last month
- Dou (道) - AI powered analysis and feedback for notes and mind maps☆28Updated last month
- List of curated use cases built using Sesame's CSM 1B☆66Updated this week
- Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.☆55Updated last month
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆35Updated last week
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆179Updated last week
- Orpheus Chat WebUI☆62Updated 2 months ago
- Local & Private LLM that drafts responses LIKE you automatically☆80Updated 6 months ago
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆186Updated 3 weeks ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated 9 months ago
- ☆67Updated 2 months ago
- OpenAI compatible API for Dia-1.6B☆29Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆30Updated last month