lamm-mit / PDF2AudioLinks
☆1,341Updated 7 months ago
Alternatives and similar repositories for PDF2Audio
Users that are interested in PDF2Audio are comparing it to the libraries listed below
Sorting:
- Convert any PDF into a podcast episode!☆2,505Updated 11 months ago
- Convert any PDF into a podcast episode!☆807Updated 8 months ago
- Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3☆1,349Updated last month
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆560Updated last month
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆767Updated 5 months ago
- podcastfy.ai gradio demo app☆334Updated 11 months ago
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆500Updated 3 months ago
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆1,558Updated 10 months ago
- Use OpenAI's realtime API for a chatting with your documents☆330Updated last year
- Company Researcher tool helps you instantly understand any company inside out.☆1,319Updated 3 months ago
- ⚡ Insanely fast AI voice assistant with <500ms response times☆577Updated 11 months ago
- A Python package that makes it easy for developers to create AI apps powered by various AI providers.☆1,650Updated 7 months ago
- openperplex is an opensource AI search engine☆884Updated last year
- A very quick project that transforms research papers into engaging three-person discussions, offering an intuitive and thought-provoking …☆601Updated 11 months ago
- This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on …☆348Updated last year
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆691Updated 4 months ago
- Local realtime voice AI☆2,378Updated 8 months ago
- Examples for Cerebrium Serverless GPUs☆512Updated 3 weeks ago
- An autoagentic AGI that is self-evolving and modular.☆964Updated last year
- napkins.dev – from screenshot to app☆1,421Updated 7 months ago
- Detect and extract tables to markdown and csv☆755Updated 9 months ago
- first base model for full-duplex conversational audio☆1,767Updated 10 months ago
- Sample apps to help developers get started with Structured Outputs☆662Updated 10 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆492Updated 9 months ago
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆236Updated 10 months ago
- Whisper with Medusa heads☆864Updated 3 months ago
- An AI personal tutor built with Llama 3.1☆1,952Updated 5 months ago
- AI video agents framework for next-gen video interactions and workflows.☆1,132Updated 3 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,325Updated 7 months ago
- AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAI☆488Updated 10 months ago