lamm-mit / PDF2AudioLinks
☆1,347Updated 7 months ago
Alternatives and similar repositories for PDF2Audio
Users that are interested in PDF2Audio are comparing it to the libraries listed below
Sorting:
- Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3☆1,349Updated 2 months ago
- Convert any PDF into a podcast episode!☆2,523Updated last year
- Convert any PDF into a podcast episode!☆809Updated 8 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆563Updated 3 weeks ago
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆774Updated 6 months ago
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆502Updated 4 months ago
- A very quick project that transforms research papers into engaging three-person discussions, offering an intuitive and thought-provoking …☆602Updated last year
- Use OpenAI's realtime API for a chatting with your documents☆330Updated last year
- openperplex is an opensource AI search engine☆886Updated last year
- podcastfy.ai gradio demo app☆334Updated last year
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆1,564Updated 10 months ago
- An AI personal tutor built with Llama 3.1☆1,953Updated last week
- AI-powered assistant to help you with your daily tasks, powered by Llama 3, DeepSeek R1, and many more models on HuggingFace.☆522Updated last week
- A Python package that makes it easy for developers to create AI apps powered by various AI providers.☆1,651Updated 8 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆292Updated 11 months ago
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆236Updated 11 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆492Updated 10 months ago
- An experimental UI for text-to-knowledge-graph generation☆780Updated last year
- An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.…☆670Updated last year
- This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on …☆348Updated last year
- An autoagentic AGI that is self-evolving and modular.☆962Updated last year
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆691Updated 5 months ago
- Sample apps to help developers get started with Structured Outputs☆660Updated 11 months ago
- ⚡ Insanely fast AI voice assistant with <500ms response times☆578Updated last year
- Local SRT/LLM/TTS Voicechat☆744Updated last year
- Prompt optimization scratch☆875Updated 7 months ago
- Company Researcher tool helps you instantly understand any company inside out.☆1,328Updated 3 months ago
- AI video agents framework for next-gen video interactions and workflows.☆1,152Updated 4 months ago
- Detect and extract tables to markdown and csv☆755Updated 10 months ago
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆318Updated 3 months ago