Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient processing for low-resource systems.
☆185Feb 26, 2026Updated 3 months ago
Alternatives and similar repositories for pdf-narrator
Users that are interested in pdf-narrator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transform your PDFs into captivating audio podcasts with this PDF-to-Podcast pipeline! Combining advanced language models and high-qualit…☆17Nov 11, 2024Updated last year
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…☆1,582Apr 8, 2026Updated 2 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆54Apr 13, 2026Updated 2 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆10Sep 22, 2024Updated last year
- StyleTTS 2 Optimized Training Fork☆32Feb 2, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- epub2tts-kokoro is a free and open source python app to easily create a full-featured audiobook from an epub or text file using realistic…☆35Feb 14, 2026Updated 4 months ago
- Code repository for TIDMAD: Time series Dataset for Discovering Dark Matter with AI Denoising.☆16Apr 1, 2026Updated 2 months ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆16May 25, 2026Updated 3 weeks ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/multiplatform CPU, AMD, NVIDIA GPU PyTorch support, handling, and auto-s…☆4,992Jun 6, 2026Updated last week
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆20Oct 13, 2025Updated 8 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 11 months ago
- 🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.☆786Jun 1, 2026Updated 2 weeks ago
- Update script for Manjaro☆11Aug 9, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Forced alignment decoder for Whisper.☆16Mar 13, 2024Updated 2 years ago
- Little utilities for Aegisub that make my life easier☆14Aug 6, 2025Updated 10 months ago
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- Equal Loudness Filter☆11Mar 4, 2019Updated 7 years ago
- Scaled Uniform Noise for Ancestral & Stochastic samplers and Noisy latent image☆17Mar 30, 2025Updated last year
- TTS with kokoro and onnx runtime☆2,583Jan 30, 2026Updated 4 months ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆18Jan 15, 2026Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Convert Files / Folders / GitHub Repos Into AI / LLM-ready Files☆164Jan 31, 2025Updated last year
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated last year
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆26Feb 1, 2026Updated 4 months ago
- This tool will help you build a 3D character rig without building it yourself from scratch. It will save you hours if not days of rigging…☆27Aug 7, 2022Updated 3 years ago
- MFLUX-WEBUI using MLX and the FLUX DEV and Schnell models☆139Feb 15, 2026Updated 4 months ago
- PortableApps.com Development Toolkit☆13Apr 8, 2016Updated 10 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆41Sep 9, 2025Updated 9 months ago
- Synchronize SRT timestamps over an existing accurate transcription☆41Nov 11, 2024Updated last year
- Download and run local LLMs within your browser☆25Sep 25, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- IPA Phonetic dataset lexicon☆18May 26, 2026Updated 3 weeks ago
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆51Jan 13, 2025Updated last year
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆39Apr 6, 2026Updated 2 months ago
- 过剑网3保护☆11Jul 28, 2017Updated 8 years ago
- Text-to-Speech Benchmark☆26Apr 2, 2026Updated 2 months ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆15Jun 27, 2023Updated 2 years ago
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆67Nov 5, 2024Updated last year