mateogon / pdf-narrator
Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient processing for low-resource systems.
☆53Updated last week
Alternatives and similar repositories for pdf-narrator:
Users that are interested in pdf-narrator are comparing it to the libraries listed below
- ☆46Updated 4 months ago
- High level tool use for LLMs☆34Updated 7 months ago
- Text generation in Python, as easy as possible☆55Updated last week
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆51Updated 3 months ago
- An API for VoiceCraft.☆25Updated 8 months ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆26Updated 2 weeks ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆45Updated 2 weeks ago
- ☆91Updated 2 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated 9 months ago
- Choose a topic, a music genre and wait for the agents to generate a song☆53Updated 8 months ago
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆31Updated 8 months ago
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆25Updated 2 months ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated 9 months ago
- Run Ollama LLM models in Google Colab for free☆33Updated 3 months ago
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆46Updated 2 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 3 months ago
- ☆17Updated 3 months ago
- Crow is a Desktop AI Assistant☆32Updated 7 months ago
- 100% Local Document deep search with LLMs☆26Updated 6 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆17Updated 5 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated this week
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 8 months ago
- Local & Private LLM that drafts responses LIKE you automatically☆77Updated 4 months ago
- ☆58Updated 6 months ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆25Updated 6 months ago
- The next evolution of Agents☆48Updated last month