run-llama / voice-chat-pdfLinks
Use OpenAI's realtime API for a chatting with your documents
☆330Updated last year
Alternatives and similar repositories for voice-chat-pdf
Users that are interested in voice-chat-pdf are comparing it to the libraries listed below
Sorting:
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆227Updated last year
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆160Updated last year
- SearchGPT / Perplexity Pages clone, but personalised for you.☆246Updated last year
- PostBot 3000 is an open-source project that shows how to build a powerful AI agent and stream responses and generate artifacts. This proj…☆290Updated last year
- podcastfy.ai gradio demo app☆334Updated last year
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆238Updated last year
- AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAI☆491Updated last year
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆319Updated 3 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆294Updated 11 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆211Updated 2 months ago
- Assistant for voice-to-blog writing☆147Updated 11 months ago
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆501Updated 4 months ago
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆249Updated 3 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆115Updated last year
- openperplex is an opensource AI search engine☆171Updated last year
- ☆252Updated 11 months ago
- Turn local files into a prompt for an LLM☆177Updated 11 months ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆220Updated last year
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆222Updated 2 months ago
- The AI assistant for computer control.☆326Updated last year
- NotebookLlama powered by Groq - Create podcasts on any topic lightning fast☆78Updated last year
- Chat with any website on your local machine☆85Updated last year
- ☆264Updated last year
- ☆149Updated last year
- A cool AI Diagram generator from a given topic, that streams the partial diagrams from the incomplete JSONs during generation. Built usin…☆216Updated last year
- ☆318Updated last year
- 📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)☆330Updated 9 months ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆79Updated last year
- napkins.dev – from screenshot to app☆86Updated last year
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆402Updated 6 months ago