run-llama / voice-chat-pdf
Use OpenAI's realtime API for a chatting with your documents
☆309Updated 3 months ago
Alternatives and similar repositories for voice-chat-pdf:
Users that are interested in voice-chat-pdf are comparing it to the libraries listed below
- ☆217Updated last month
- AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAI☆404Updated last month
- Realtime API with Firecrawl Tool - Forked from the OpenAI Realtime Console☆151Updated 3 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆209Updated 3 weeks ago
- SearchGPT / Perplexity Pages clone, but personalised for you.☆231Updated 4 months ago
- podcastfy.ai gradio demo app☆325Updated last month
- New user experiences for interacting with agents☆376Updated this week
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.☆205Updated 3 months ago
- PostBot 3000 is an open-source project that shows how to build a powerful AI agent and stream responses and generate artifacts. This proj…☆285Updated 2 months ago
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆210Updated last month
- Assistant for voice-to-blog writing☆121Updated this week
- mind map generator☆67Updated last month
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆376Updated 2 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆175Updated last month
- Turn local files into a prompt for an LLM☆160Updated last week
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆194Updated 3 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆102Updated 2 months ago
- Build realtime voice and video agents with Google's new Gemini 2.0 (API is free for now)☆229Updated last month
- ☆167Updated 2 weeks ago
- Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.☆245Updated last week
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆472Updated last week
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆159Updated last month
- Leverage the OpenAI Realtime API (12-17-2024) with this Next.js 15 starter template featuring shadcn/ui components, tool-calling & locali…☆191Updated this week
- ☆178Updated 2 months ago
- NotebookLlama powered by Groq - Create podcasts on any topic lightning fast☆66Updated 3 months ago
- ☆84Updated last week
- A powerful Python tool for performing technical searches using the Perplexity API, optimized for retrieving precise facts, code examples,…☆189Updated 2 weeks ago
- This repository hosts a suite of specialized agents designed to power your brainstorming sessions. Each agent brings a unique perspective…☆281Updated 2 months ago
- ☆120Updated 6 months ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆72Updated 4 months ago