mateogon / pdf-narrator
Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient processing for low-resource systems.
☆21Updated this week
Alternatives and similar repositories for pdf-narrator:
Users that are interested in pdf-narrator are comparing it to the libraries listed below
- ☆40Updated 2 months ago
- High level tool use for LLMs☆34Updated 5 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆31Updated this week
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 3 months ago
- ☆29Updated last year
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆28Updated this week
- ☆18Updated 2 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated last month
- ☆21Updated 5 months ago
- 🍳 AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages 🤌🧑🍳☆21Updated 2 months ago
- Your Python AI Coder!☆31Updated 2 weeks ago
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated 6 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 6 months ago
- ☆26Updated 3 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 3 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆16Updated 3 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated 10 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆29Updated 5 months ago
- Run Ollama LLM models in Google Colab for free☆29Updated last month
- ☆18Updated 4 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆62Updated 2 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆23Updated 4 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated 7 months ago
- ☆18Updated last week
- ☆16Updated 3 weeks ago
- 100% Local Document deep search with LLMs☆25Updated 4 months ago
- Modified Beam Search with periodical restart☆12Updated 3 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆40Updated last month
- run ollama & gguf easily with a single command☆49Updated 7 months ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆40Updated 3 months ago