lamm-mit / PDF2Audio
☆1,254Updated 2 weeks ago
Alternatives and similar repositories for PDF2Audio:
Users that are interested in PDF2Audio are comparing it to the libraries listed below
- Convert any PDF into a podcast episode!☆741Updated last month
- Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3☆1,296Updated 4 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆544Updated last month
- Convert any PDF into a podcast episode!☆2,257Updated 5 months ago
- openperplex is an opensource AI search engine☆855Updated 9 months ago
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆487Updated 3 months ago
- A modular voice assistant application for experimenting with state-of-the-art transcription, response generation, and text-to-speech mode…☆973Updated 2 weeks ago
- podcastfy.ai gradio demo app☆330Updated 5 months ago
- Use OpenAI's realtime API for a chatting with your documents☆328Updated 7 months ago
- Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.☆1,055Updated last month
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite☆926Updated this week
- List of software that allows searching the web with the assistance of AI: https://hf.co/spaces/felladrin/awesome-ai-web-search☆837Updated this week
- An Open Source implementation of Notebook LM with more flexibility and features☆1,484Updated last week
- Prompt optimization scratch☆724Updated 3 weeks ago
- SearchGPT / Perplexity clone, but personalised for you.☆1,126Updated 9 months ago
- Local realtime voice AI☆2,287Updated 2 months ago
- Sample apps to help developers get started with Structured Outputs☆634Updated 3 months ago
- first base model for full-duplex conversational audio☆1,737Updated 4 months ago
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆355Updated 7 months ago
- AI computer use powered by open source LLMs and E2B Desktop Sandbox☆1,108Updated last month
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆362Updated 5 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆900Updated last week
- An autoagentic AGI that is self-evolving and modular.☆944Updated 8 months ago
- An experimental UI for text-to-knowledge-graph generation☆770Updated last year
- ☆1,218Updated 6 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,143Updated this week
- 📲 An agent for sourcing, curating, and scheduling social media posts with human-in-the-loop.☆1,188Updated this week
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆639Updated 3 weeks ago
- ☆783Updated last year
- AI-powered assistant to help you with your daily tasks, powered by Llama 3, DeepSeek R1, and many more models on HuggingFace.☆499Updated 2 months ago