lamm-mit / PDF2AudioLinks
☆1,274Updated 2 months ago
Alternatives and similar repositories for PDF2Audio
Users that are interested in PDF2Audio are comparing it to the libraries listed below
Sorting:
- Convert any PDF into a podcast episode!☆2,347Updated 6 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆552Updated 3 weeks ago
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆670Updated 2 weeks ago
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆491Updated 5 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆490Updated 4 months ago
- podcastfy.ai gradio demo app☆334Updated 6 months ago
- Convert any PDF into a podcast episode!☆764Updated 3 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,113Updated 2 months ago
- openperplex is an opensource AI search engine☆864Updated 10 months ago
- Prompt optimization scratch☆753Updated 2 months ago
- Use OpenAI's realtime API for a chatting with your documents☆330Updated 8 months ago
- Local realtime voice AI☆2,328Updated 3 months ago
- A Python package that makes it easy for developers to create AI apps powered by various AI providers.☆1,620Updated 2 months ago
- 📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)☆296Updated 3 months ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆921Updated 4 months ago
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,015Updated last week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,136Updated last month
- Implementation of F5-TTS in MLX☆554Updated 3 months ago
- RAT is a powerful tool that improves AI responses by leveraging DeepSeek's reasoning capabilities to guide other models through a structu…☆626Updated 4 months ago
- Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3☆1,310Updated 5 months ago
- Whisper with Medusa heads☆842Updated 3 weeks ago
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆378Updated 7 months ago
- An Open Source implementation of Notebook LM with more flexibility and features☆1,898Updated last week
- Examples for Cerebrium Serverless GPUs☆489Updated last week
- This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on …☆344Updated 8 months ago
- 📲 An agent for sourcing, curating, and scheduling social media posts with human-in-the-loop.☆1,284Updated last month
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆1,491Updated 5 months ago
- SearchGPT / Perplexity Pages clone, but personalised for you.☆242Updated 9 months ago
- Generate accurate transcripts using Apple's MLX framework☆410Updated last month
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆660Updated this week