lamm-mit / PDF2Audio
☆1,206Updated 6 months ago
Alternatives and similar repositories for PDF2Audio:
Users that are interested in PDF2Audio are comparing it to the libraries listed below
- Convert any PDF into a podcast episode!☆2,163Updated 3 months ago
- Convert any PDF into a podcast episode!☆705Updated last week
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆532Updated last month
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆483Updated 2 months ago
- Sample apps to help developers get started with Structured Outputs☆621Updated 2 months ago
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆1,434Updated 2 months ago
- An Open Source implementation of Notebook LM with more flexibility and features☆1,211Updated 2 weeks ago
- Use OpenAI's realtime API for a chatting with your documents☆320Updated 5 months ago
- A Fast TTS Engine☆471Updated 2 months ago
- Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3☆1,288Updated 3 months ago
- podcastfy.ai gradio demo app☆330Updated 3 months ago
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆599Updated this week
- RAT is a powerful tool that improves AI responses by leveraging DeepSeek's reasoning capabilities to guide other models through a structu…☆608Updated 2 months ago
- Prompt optimization scratch☆672Updated 3 weeks ago
- An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Co…☆3,433Updated last month
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆538Updated last week
- Generate accurate transcripts using Apple's MLX framework☆386Updated this week
- openperplex is an opensource AI search engine☆846Updated 7 months ago
- ☆780Updated last year
- An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.…☆623Updated 4 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆926Updated last month
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite☆870Updated last week
- the simplest self-building coding agent☆962Updated 5 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆248Updated 2 months ago
- A list of software that allows searching the web with the assistance of AI: https://hf.co/spaces/felladrin/awesome-ai-web-search☆776Updated 3 weeks ago
- AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAI☆431Updated 3 months ago
- 📲 An agent for sourcing, curating, and scheduling social media posts with human-in-the-loop.☆1,103Updated 3 weeks ago
- Portable KMS (knowledge management system) designed to integrate seamlessly with any Retrieval-Augmented Generation (RAG) system☆1,137Updated this week
- Witsy: desktop AI assistant☆792Updated this week
- Implementation of F5-TTS in MLX☆509Updated last week