A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.
☆143Jun 18, 2024Updated last year
Alternatives and similar repositories for webrtc-ai-voice-chat
Users that are interested in webrtc-ai-voice-chat are comparing it to the libraries listed below
Sorting:
- Local SRT/LLM/TTS Voicechat☆759Oct 12, 2024Updated last year
- RAG Chatbot powered by Groq LPU, Ollama and Langchain☆13Mar 5, 2024Updated 2 years ago
- ☆24Sep 4, 2024Updated last year
- ☆11Nov 8, 2023Updated 2 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- ☆44Oct 8, 2024Updated last year
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆15Jan 9, 2025Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆87Dec 28, 2025Updated 2 months ago
- Open Source framework for voice and multimodal conversational AI☆10,529Mar 3, 2026Updated last week
- This Streamlit application creates an interactive Data Visualization Assistant that can understand Natural Language Queries and generate …☆17Jan 13, 2025Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated last year
- Multi-Agent AI App from Scratch in python without any depedency of framework☆15Jan 7, 2025Updated last year
- Starter app for creating an AI task completion agent with gmail capabilities.☆27Jun 25, 2024Updated last year
- A RESTful API to access the entire Project Gutenberg catalogue.☆13Jun 26, 2024Updated last year
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- ☆15Jun 27, 2023Updated 2 years ago
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆12Sep 19, 2024Updated last year
- This software contains an agent based on LangGraph & LangChain for solving general requests in the Whatsapp channel of this medical clini…☆216Oct 5, 2024Updated last year
- Example UI implementing the RTVI web client☆474Dec 3, 2024Updated last year
- ☆30Jun 12, 2025Updated 8 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆4,486Updated this week
- [WIP]Trying to implement "Ultra Low Complexity Deep Learning Based Noise Suppression." arXiv preprint arXiv:2312.08132 (2023).☆26May 29, 2024Updated last year
- Bookmarklet to pull and run hugging face GGUF models in Ollama☆17Oct 17, 2024Updated last year
- Have a natural voice conversation with an LLM☆262Jan 20, 2026Updated last month
- On-device voice activity detection (VAD) powered by deep learning☆245Mar 2, 2026Updated last week
- This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.☆22Jan 22, 2024Updated 2 years ago
- AI narrator☆15Nov 24, 2023Updated 2 years ago
- ☆17Jun 14, 2024Updated last year
- Distillation of Self-Supervised Representation-Based Speech Quality Assessment☆44May 15, 2025Updated 9 months ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- An agent to generate stunning images :)☆23May 22, 2025Updated 9 months ago
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆15Aug 22, 2023Updated 2 years ago
- Joint speech-language model - respond directly to audio!☆372Jul 1, 2024Updated last year
- An intelligent agent utilizing Large Language Models (LLMs) for automated financial news retrieval and stock price prediction.☆21Sep 9, 2024Updated last year
- Agent-Friendly Web Principles☆29Oct 15, 2025Updated 4 months ago
- ☆88Mar 20, 2025Updated 11 months ago