Use OpenAI's realtime API for a chatting with your documents
☆328Oct 6, 2024Updated last year
Alternatives and similar repositories for voice-chat-pdf
Users that are interested in voice-chat-pdf are comparing it to the libraries listed below
Sorting:
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆163Jan 7, 2026Updated 2 months ago
- AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAI☆494Dec 25, 2024Updated last year
- ☆169Oct 31, 2024Updated last year
- React app for inspecting, building and debugging with the Realtime API☆3,563Aug 28, 2025Updated 6 months ago
- A simple client and utils for interacting with OpenAI's Realtime API in Python☆244May 15, 2025Updated 9 months ago
- ☆1,366Apr 18, 2025Updated 10 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆236Oct 24, 2024Updated last year
- ☆602Oct 26, 2024Updated last year
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆360Oct 7, 2024Updated last year
- ☆74Sep 27, 2024Updated last year
- Structured information extraction from documents☆318Sep 26, 2024Updated last year
- podcastfy.ai gradio demo app☆334Nov 30, 2024Updated last year
- Open-source Next.js template for building apps that are fully generated by AI. By E2B.☆6,194Feb 28, 2026Updated last week
- An AI personal tutor built with Llama 3.1☆1,991Feb 27, 2026Updated last week
- Use OpenAI's realtime API for a chatting with your documents☆248Jan 15, 2025Updated last year
- 📃 A better UX for chat, writing content, and coding with LLMs.☆5,389Feb 25, 2026Updated last week
- Play with OpenAI's new Realtime API in your browser☆338Sep 15, 2025Updated 5 months ago
- Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3☆1,354Oct 9, 2025Updated 4 months ago
- Open source inference code for Rev's model☆435Apr 22, 2025Updated 10 months ago
- The easiest way to get started with LlamaIndex☆1,477Jul 16, 2025Updated 7 months ago
- Datalore is an AI-powered Data Analysis tool that integrates Anthropic's Claude API with various data analysis libraries and custom funct…☆42Feb 24, 2025Updated last year
- Developer showcase of projects built on Cartesia☆20Aug 28, 2024Updated last year
- The easiest way to use Agentic RAG in any enterprise☆4,405Jan 22, 2025Updated last year
- Generate accurate transcripts using Apple's MLX framework☆451Apr 26, 2025Updated 10 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆302Jan 3, 2025Updated last year
- ComfyUI custom nodes for Luma AI Dream Machine API☆207Mar 31, 2025Updated 11 months ago
- o1-engineer is a command-line tool designed to assist developers in managing and interacting with their projects efficiently. Leveraging …☆2,863Dec 16, 2024Updated last year
- A realtime AI image generator☆1,019Dec 15, 2025Updated 2 months ago
- Open source Claude Artifacts – built with Llama 3.1 405B☆6,886Updated this week
- Leverage the OpenAI Realtime API (12-17-2024) with this Next.js 15 starter template featuring shadcn/ui components, tool-calling & locali…☆445Apr 12, 2025Updated 10 months ago
- Example backend infrastructure for launching new bots that connect to your RTVI clients.☆19Oct 10, 2024Updated last year
- ☆23Oct 19, 2024Updated last year
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆405Jun 26, 2025Updated 8 months ago
- Open Source framework for voice and multimodal conversational AI☆10,529Updated this week
- OpenAI Realtime API Voice Agent with RAG, Function Calling, and Caller History☆127Oct 14, 2024Updated last year
- ☆122Oct 7, 2024Updated last year
- Perplexity Inspired Answer Engine☆5,016Jun 27, 2025Updated 8 months ago
- ☆20Oct 24, 2025Updated 4 months ago
- ⚡ Insanely fast AI voice assistant with <500ms response times☆584Dec 3, 2024Updated last year