itsanuragkumarjha / Voice-chat-enabled-RAG-chatbot-with-real-time-internet-accessLinks
An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features include speech-to-text with Nemo, text generation with Mistral-7B, DuckDuckGo search integration, and text-to-speech with edge-tts, all in a user-friendly Gradio interface.
☆18Updated last year
Alternatives and similar repositories for Voice-chat-enabled-RAG-chatbot-with-real-time-internet-access
Users that are interested in Voice-chat-enabled-RAG-chatbot-with-real-time-internet-access are comparing it to the libraries listed below
Sorting:
- Multimodal AI App using Llava 7B and Gradio.☆39Updated last year
- Talking head video AI generator☆81Updated last year
- AI Lip Syncing application, deployed on Streamlit☆43Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆140Updated last year
- Performing a RAG (Retrieval Augmented Generation) assessment using voice-to-voice query resolution. Provide the file containing the queri…☆44Updated last year
- Talk to GPT-4 and create a story together.☆91Updated last year
- A general purpose AI voice assistant built using GPT-4.☆33Updated 2 years ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆88Updated this week
- Simple Chainlit UI for running llms locally using Ollama and LangChain☆46Updated last year
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 3 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 11 months ago
- Harness the power of NVIDIA technologies and LangChain to create dynamic avatars from live speech, integrating RIVA ASR and TTS with Audi…☆95Updated last year
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆47Updated last year
- A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability☆44Updated 2 years ago
- ☆21Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- Chatbot with a 3D avatar that can answer interview questions in your behalf. It can speak and understand English, German and Albanian. Ba…☆37Updated 2 weeks ago
- Opinionated Langchain setup with Qdrant vector store and Kong gateway☆33Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 5 months ago
- Open Sourced NoteBookLM☆59Updated last year
- AI Agents with Google's Gemini Pro and Gemini Pro Vision Models☆28Updated last year
- Self-hosted AI voice agent☆120Updated last year
- The objective of the Speaking Portal Project is to design, develop, and deploy a lip-sync animation API for the Kukarella text-to-speech …☆13Updated 2 years ago
- Get started using Deepgram's Live Transcription with this Flask demo app☆41Updated 2 weeks ago
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆15Updated last year
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆217Updated 3 months ago
- Chatbot web-applications with LLM, OpenAI API Assistants, LangChain, vector databases, and other AI stuff☆25Updated last year
- Simli WebRTC AI Agent demo☆23Updated 11 months ago
- ☆32Updated 2 years ago
- OpenAI Whisper API-style local server, runnig on FastAPI☆87Updated last month