mateogon / pdf-narrator
Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient processing for low-resource systems.
☆62Updated 2 weeks ago
Alternatives and similar repositories for pdf-narrator:
Users that are interested in pdf-narrator are comparing it to the libraries listed below
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 4 months ago
- A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆56Updated last week
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆22Updated 2 weeks ago
- Use smol agents to do research and then update csv coumns with its findings.☆37Updated 2 months ago
- 100% Local Document deep search with LLMs☆26Updated 7 months ago
- ☆91Updated 2 months ago
- ☆45Updated last month
- ☆24Updated 2 months ago
- Streaming and Finetuning code for CSM☆67Updated this week
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆30Updated 6 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 5 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆45Updated last month
- Agentic RAG to help you build a startup🚀☆19Updated last week
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆60Updated 3 weeks ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 4 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated 3 weeks ago
- Local & Private LLM that drafts responses LIKE you automatically☆78Updated 4 months ago
- ☆47Updated 5 months ago
- This Chrome extension integrates screen reader functionality using the XttS-webui API. Currently in beta and using the XttS Server API ba…☆23Updated 3 weeks ago
- Run Ollama LLM models in Google Colab for free☆33Updated 4 months ago
- Deploy Apollo HF space locally☆40Updated 3 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 7 months ago
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆25Updated 3 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆30Updated this week
- Automated LLM novelist☆44Updated last year
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆73Updated 3 weeks ago
- Self-hosted Ollama + Whisper powered AI medical scribe.☆22Updated this week
- Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interf…☆40Updated 5 months ago
- List of curated use cases built using Sesame's CSM 1B☆58Updated 3 weeks ago
- ☆31Updated last month