mateogon / pdf-narrator
Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient processing for low-resource systems.
☆43Updated last week
Alternatives and similar repositories for pdf-narrator:
Users that are interested in pdf-narrator are comparing it to the libraries listed below
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆39Updated 4 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆49Updated 2 months ago
- Choose a topic, a music genre and wait for the agents to generate a song☆52Updated 7 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆57Updated last week
- [WIP] AI Try-On plugin for Chrome☆27Updated 11 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆29Updated this week
- High level tool use for LLMs☆34Updated 6 months ago
- Automated LLM novelist☆42Updated 10 months ago
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆44Updated last month
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated 2 months ago
- ☆43Updated 3 months ago
- 🍳 AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages 🤌🧑🍳☆21Updated 3 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated 11 months ago
- ☆18Updated 5 months ago
- kokoro text to speech using javascript☆52Updated 3 weeks ago
- A discord bot to stay up to date with Hugging Face Daily Papers.☆15Updated 10 months ago
- A lightweight Python library for running TTS models with a unified API.☆16Updated last month
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆17Updated 4 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated 8 months ago
- An API for VoiceCraft.☆26Updated 7 months ago
- idea: https://github.com/nyxkrage/ebook-groupchat/☆85Updated 6 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆56Updated 4 months ago
- ☆22Updated 3 months ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated 8 months ago
- ☆12Updated last year
- Run Ollama LLM models in Google Colab for free☆32Updated 2 months ago
- Auto-Video maker handling many AI's☆10Updated 11 months ago
- AI-augmented, conversational information retrieval and data exploration☆39Updated 11 months ago