run-llama / voice-chat-pdf
Use OpenAI's realtime API for a chatting with your documents
☆317Updated 4 months ago
Alternatives and similar repositories for voice-chat-pdf:
Users that are interested in voice-chat-pdf are comparing it to the libraries listed below
- PostBot 3000 is an open-source project that shows how to build a powerful AI agent and stream responses and generate artifacts. This proj…☆285Updated 3 months ago
- AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAI☆418Updated 2 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆210Updated 4 months ago
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆153Updated 4 months ago
- Assistant for voice-to-blog writing☆129Updated last month
- ☆274Updated 2 weeks ago
- podcastfy.ai gradio demo app☆329Updated 3 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆185Updated 2 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆242Updated 2 months ago
- New user experiences for interacting with agents☆429Updated this week
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆102Updated 3 months ago
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆261Updated 3 weeks ago
- ☆122Updated 3 months ago
- ☆187Updated last month
- A Chrome extension for asking questions over websites☆306Updated 3 weeks ago
- SearchGPT / Perplexity Pages clone, but personalised for you.☆235Updated 6 months ago
- openperplex is an opensource AI search engine☆164Updated 7 months ago
- This repository hosts a suite of specialized agents designed to power your brainstorming sessions. Each agent brings a unique perspective…☆288Updated 3 months ago
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆215Updated 2 months ago
- Turn local files into a prompt for an LLM☆165Updated last month
- ☆347Updated 2 months ago
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆209Updated 4 months ago
- AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more☆287Updated 7 months ago
- ☆199Updated 4 months ago
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆347Updated 3 months ago
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆173Updated 2 months ago