itsanuragkumarjha / Voice-chat-enabled-RAG-chatbot-with-real-time-internet-access
An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features include speech-to-text with Nemo, text generation with Mistral-7B, DuckDuckGo search integration, and text-to-speech with edge-tts, all in a user-friendly Gradio interface.
☆12Updated 7 months ago
Alternatives and similar repositories for Voice-chat-enabled-RAG-chatbot-with-real-time-internet-access:
Users that are interested in Voice-chat-enabled-RAG-chatbot-with-real-time-internet-access are comparing it to the libraries listed below
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆25Updated 5 months ago
- Simulates talk with an AI that can express emotions☆43Updated 5 months ago
- AI Voice Assistant: talk to an AI agent that handles event scheduling, managing contacts, accessing your knowledge base and web searching…☆23Updated last month
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆25Updated 2 months ago
- Multimodal AI App using Llava 7B and Gradio.☆38Updated 8 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆37Updated 3 months ago
- 🎙️ Speak with AI - Run locally using Ollama, OpenAI or xAI - Speech uses XTTS, OpenAI or ElevenLabs☆132Updated this week
- AI Lip Syncing application, deployed on Streamlit☆35Updated 10 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run; if…☆21Updated this week
- Voice Assistant based on Whisper ASR and ChatGPT API☆63Updated last year
- Co-create a PowerPoint presentation with Generative AI☆93Updated 2 weeks ago
- Use Text to SQL to analyze US Government contract data☆16Updated 3 weeks ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆119Updated 7 months ago
- Using Whisper, elevenlabs, PaLM, and Twilio to create a virtual phone assistant☆16Updated last year
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆33Updated last week
- This is a sample example repo on how to extend Vapi functionalities and deploy it on Vercel Edge Functions.☆17Updated 6 months ago
- Self-hosted AI voice agent☆77Updated 4 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆42Updated 3 weeks ago
- Outbound Phone GPT is a sophisticated prototype for a context-aware agent designed to autonomously handle outbound phone calls.☆12Updated 9 months ago
- Okra, your all in one personal AI assistant☆14Updated 7 months ago
- Agentic Chat App is an advanced AI-powered chat application designed for seamless real-time communication and intelligent responses. Buil…☆57Updated 6 months ago
- Retrieval Augmented Generation-based Agentic CrewAI☆23Updated 2 months ago
- Generate video stories with AI ✨☆29Updated 4 months ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆15Updated 11 months ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆15Updated last year
- WIP exploration using Twilio Media Streams and Generative AI☆38Updated 11 months ago
- Harness the power of NVIDIA technologies and LangChain to create dynamic avatars from live speech, integrating RIVA ASR and TTS with Audi…☆55Updated 6 months ago
- [WIP] AI Try-On plugin for Chrome☆26Updated 10 months ago
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) framework☆18Updated 8 months ago
- An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meet…☆27Updated 2 months ago