lamm-mit / PDF2AudioLinks

☆1,291

Alternatives and similar repositories for PDF2Audio

Users that are interested in PDF2Audio are comparing it to the libraries listed below

Sorting:

gabrielchua / open-notebooklm
Convert any PDF into a podcast episode!
☆2,416Updated 7 months ago
knowsuchagency / pdf-to-podcast
Convert any PDF into a podcast episode!
☆789Updated 4 months ago
Azzedde / paper_to_podcast
A very quick project that transforms research papers into engaging three-person discussions, offering an intuitive and thought-provoking …
☆581Updated 7 months ago
NVIDIA-AI-Blueprints / pdf-to-podcast
Transform PDFs into AI podcasts for engaging on-the-go audio content.
☆724Updated 2 months ago
Bklieger / infinite-bookshelf
Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3
☆1,319Updated 7 months ago
souzatharsis / podcastfy-demo
podcastfy.ai gradio demo app
☆335Updated 8 months ago
SouthBridgeAI / offmute
An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though
☆556Updated 2 months ago
Bklieger / ScribeWizard
ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3
☆494Updated 6 months ago
echohive42 / AI-reads-books-page-by-page
AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…
☆1,505Updated 6 months ago
YassKhazzan / openperplex_backend_os
openperplex is an opensource AI search engine
☆869Updated last year
sofi444 / realtime-transcription-fastrtc
Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗
☆671Updated 3 weeks ago
aiola-lab / whisper-medusa
Whisper with Medusa heads
☆850Updated 3 weeks ago
Standard-Intelligence / hertz-dev
first base model for full-duplex conversational audio
☆1,748Updated 7 months ago
run-llama / voice-chat-pdf
Use OpenAI's realtime API for a chatting with your documents
☆331Updated 10 months ago
adarshb3 / Virtual-Try-On-Application-using-Flask-Twilio-and-Gradio
This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on …
☆346Updated 9 months ago
reflex-dev / reflex-llm-examples
☆826Updated 2 months ago
misbahsy / meetingmind
AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAI
☆466Updated 7 months ago
CerebriumAI / examples
Examples for Cerebrium Serverless GPUs
☆508Updated this week
langchain-ai / social-media-agent
📲 An agent for sourcing, curating, and scheduling social media posts with human-in-the-loop.
☆1,394Updated this week
Nutlope / llamatutor
An AI personal tutor built with Llama 3.1
☆1,898Updated 2 months ago
dsa / fast-voice-assistant
⚡ Insanely fast AI voice assistant with <500ms response times
☆412Updated 8 months ago
satvik314 / opensource_notebooklm
An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.
☆284Updated 7 months ago
astramind-ai / Auralis
A Fast TTS Engine
☆529Updated 6 months ago
hinthornw / promptimizer
Prompt optimization scratch
☆783Updated 3 months ago
openai / openai-structured-outputs-samples
Sample apps to help developers get started with Structured Outputs
☆649Updated 6 months ago
bklieger-groq / mathtutor-on-groq
Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!
☆231Updated 7 months ago
lhl / voicechat2
Local SRT/LLM/TTS Voicechat
☆702Updated 9 months ago
pipecat-ai / rtvi-web-demo
Example UI implementing the RTVI web client
☆477Updated 8 months ago
kyutai-labs / hibiki
Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…
☆1,252Updated 3 months ago
superlinear-ai / raglite
🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL
☆1,043Updated last month