mateogon/pdf-narrator

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mateogon/pdf-narrator)

mateogon / pdf-narrator

Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient processing for low-resource systems.

☆198

Alternatives and similar repositories for pdf-narrator

Users that are interested in pdf-narrator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MoAshour93 / Construction_Convert_Books_to_Podcasts
View on GitHub
Transform your PDFs into captivating audio podcasts with this PDF-to-Podcast pipeline! Combining advanced language models and high-qualit…
☆17Nov 11, 2024Updated last year
richardr1126 / openreader
View on GitHub
An open-source read-along document reader server with high-quality TTS options, synchronized highlighting, and audiobook export for EPUB,…
☆483Jul 22, 2026Updated last week
JarodMica / F5-TTS
View on GitHub
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆54Apr 13, 2026Updated 3 months ago
im-knots / the-academy
View on GitHub
A Socratic dialogue engine for AI agents.
☆15Nov 30, 2025Updated 7 months ago
nazdridoy / kokoro-tts
View on GitHub
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…
☆1,729Apr 8, 2026Updated 3 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
JarodMica / StyleTTS-ZS
View on GitHub
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
☆10Sep 22, 2024Updated last year
duerig / StyleTTS2
View on GitHub
StyleTTS 2 Optimized Training Fork
☆32Feb 2, 2025Updated last year
xiaolaa2 / midi-file-mcp
View on GitHub
A powerful MCP tool for parsing and manipulating MIDI files based on Tone.js. This library leverages the Model Context Protocol (MCP) to …
☆11May 9, 2025Updated last year
justinlime / Fatterbox
View on GitHub
Open API and Wyoming wrapper around Chatterbox
☆28Jan 2, 2026Updated 6 months ago
watercrawl / self-hosted
View on GitHub
A self-hosted version of WaterCrawl, a powerful web crawling and data extraction platform.
☆13Jul 27, 2025Updated last year
EveryVoiceTTS / EveryVoice
View on GitHub
The EveryVoice TTS Toolkit - Text To Speech for your language
☆43Updated this week
huangruizhe / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆10Sep 30, 2024Updated last year
1rvinn / mindpalace
View on GitHub
find it hard to understand long github repos and pdfs? struggle no more, just enter your mindpalace. mindpalace helps you understand the …
☆15Aug 27, 2025Updated 11 months ago
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
lucasjinreal / Kokoros
View on GitHub
🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.
☆803Jun 19, 2026Updated last month
llm-lab-org / CLASP
View on GitHub
CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval
☆13Jun 27, 2025Updated last year
The-Swarm-Corporation / OmniParse
View on GitHub
Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …
☆20Oct 13, 2025Updated 9 months ago
vanzan01 / Claude-Code-MindPalace
View on GitHub
Never write commit messages again. Auto-checkpoint every change, auto-squash into professional git history. For Claude Code.
☆17Jul 19, 2025Updated last year
NeuralVox / StyleTTS2
View on GitHub
☆98Apr 27, 2024Updated 2 years ago
TigreGotico / chatterbox-onnx
View on GitHub
chatterbox TTS + Voice Clone using onnx
☆28Jul 20, 2026Updated last week
cpuimage / EqualLoudness
View on GitHub
Equal Loudness Filter
☆11Mar 4, 2019Updated 7 years ago
SpaghettiFibonacci / clarity
View on GitHub
Perplexity.ai clone recipe for a DIY api or ChatGPT citing sources
☆10Mar 14, 2023Updated 3 years ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Deveraux-Parker / Nvidia_parakeet-tdt-0.6b-v2-FAST-BATCHING-API-1200x-RTFx
View on GitHub
☆43Oct 9, 2025Updated 9 months ago
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
prakharsr / audiobook-creator
View on GitHub
Audiobook Creator is an app that converts books (EPUB, PDF, TXT etc.) into fully voiced audiobooks with intelligent character voice attri…
☆517Nov 17, 2025Updated 8 months ago
hexgrad / kokoro
View on GitHub
https://hf.co/hexgrad/Kokoro-82M
☆8,149Aug 6, 2025Updated 11 months ago
bvhari / ComfyUI_SUNoise
View on GitHub
Scaled Uniform Noise for Ancestral & Stochastic samplers and Noisy latent image
☆17Mar 30, 2025Updated last year
cpttripzz / Chatterblez
View on GitHub
Generate audiobooks from pdf or epub using Next-gen AI Chatterbox-tts from Resemble-AI
☆20Apr 26, 2026Updated 3 months ago
Agora-Lab-AI / OmegaViT
View on GitHub
OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…
☆15Updated this week
JSideris / simple-compute-shaders
View on GitHub
☆15Feb 2, 2026Updated 5 months ago
DivergerThinking / llm_messaging_apps
View on GitHub
Demo repository for creating a custom chatbot powered by LLMs for Telegram and Whatsapp.
☆15Jan 18, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Cam10001110101 / mcp-server-ollama-deep-researcher
View on GitHub
⛔ ARCHIVED — migrated to mcpcentral-io/mcpcentral apps/deep-researcher (ADR-043, 2026-07-23)
☆16Updated this week
thewh1teagle / kokoro-onnx
View on GitHub
TTS with kokoro and onnx runtime
☆2,647Jul 5, 2026Updated 3 weeks ago
aqtq314 / VogenSVS
View on GitHub
☆15Apr 16, 2026Updated 3 months ago
cdeck3r / 3DScanner
View on GitHub
Software for a person-sized, full body DIY Raspberry Pi based 3D Scanner
☆12Oct 7, 2023Updated 2 years ago
aholab / AhoTTS
View on GitHub
Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…
☆18Jan 15, 2026Updated 6 months ago
dirmacs / ares
View on GitHub
Agentic AI server in Rust. Multi-provider LLM routing, tool calling, RAG, MCP, multi-tenant workflows.
☆16Jun 18, 2026Updated last month
wonjune-kang / expressive-speech-retrieval
View on GitHub
Expressive Speech Retrieval using Natural Language Descriptions of Speaking Style
☆15Aug 18, 2025Updated 11 months ago