Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient processing for low-resource systems.
☆168Feb 26, 2026Updated last month
Alternatives and similar repositories for pdf-narrator
Users that are interested in pdf-narrator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transform your PDFs into captivating audio podcasts with this PDF-to-Podcast pipeline! Combining advanced language models and high-qualit…☆17Nov 11, 2024Updated last year
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…☆1,331Dec 15, 2025Updated 3 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆10Sep 22, 2024Updated last year
- Automatically convert epubs to audiobooks☆259Mar 8, 2025Updated last year
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code repository for TIDMAD: Time series Dataset for Discovering Dark Matter with AI Denoising.☆15Mar 4, 2026Updated 3 weeks ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Oct 28, 2025Updated 5 months ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Updated this week
- A self-hosted version of WaterCrawl, a powerful web crawling and data extraction platform.☆13Jul 27, 2025Updated 8 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆43Mar 19, 2026Updated last week
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- Top GitHub Repositories for AI and ML Enthusiasts☆10Jan 1, 2025Updated last year
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆4,623Jan 4, 2026Updated 2 months ago
- 🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.☆738Mar 11, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Little utilities for Aegisub that make my life easier☆14Aug 6, 2025Updated 7 months ago
- Forced alignment decoder for Whisper.☆15Mar 13, 2024Updated 2 years ago
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆49Jan 19, 2026Updated 2 months ago
- TTS with kokoro and onnx runtime☆2,437Jan 30, 2026Updated 2 months ago
- ☆15Aug 22, 2025Updated 7 months ago
- Scaled Uniform Noise for Ancestral & Stochastic samplers and Noisy latent image☆17Mar 30, 2025Updated 11 months ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆18Jan 15, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- Convert Files / Folders / GitHub Repos Into AI / LLM-ready Files☆162Jan 31, 2025Updated last year
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆105Nov 19, 2025Updated 4 months ago
- ☆12Jul 25, 2020Updated 5 years ago
- Demo repository for creating a custom chatbot powered by LLMs for Telegram and Whatsapp.☆15Jan 18, 2024Updated 2 years ago
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆24Feb 1, 2026Updated last month
- This tool will help you build a 3D character rig without building it yourself from scratch. It will save you hours if not days of rigging…☆27Aug 7, 2022Updated 3 years ago
- PortableApps.com Development Toolkit☆13Apr 8, 2016Updated 9 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 7 months ago
- Synchronize SRT timestamps over an existing accurate transcription☆41Nov 11, 2024Updated last year
- IPA Phonetic dataset lexicon☆18Mar 20, 2026Updated last week
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆52Jan 13, 2025Updated last year
- Text-to-Speech Benchmark☆23Updated this week
- 通过HTML模版的样式,生成游戏卡牌的插件,这仅仅是一个方案可行性的测试流程☆27Jun 14, 2024Updated last year
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆44Oct 30, 2025Updated 5 months ago