freddyaboulton / gradio-webrtc
Realtime Video and Audio Streaming with WebRTC and Gradio
☆198Updated this week
Alternatives and similar repositories for gradio-webrtc:
Users that are interested in gradio-webrtc are comparing it to the libraries listed below
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆102Updated 2 months ago
- Have a natural voice conversation with an LLM☆235Updated last month
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆120Updated 7 months ago
- ☆172Updated 5 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆38Updated 3 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆176Updated last month
- LLaSA: Scaling Train-time and Test-time Compute for LLaMA-based Speech Synthesis☆230Updated this week
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆868Updated 3 months ago
- Collection of Open Source Speech Data☆151Updated 2 months ago
- Examples for Cerebrium Serverless GPUs☆456Updated this week
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆210Updated last month
- Interface for OuteTTS models.☆902Updated this week
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆426Updated 2 weeks ago
- Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.☆245Updated this week
- A simple client and utils for interacting with OpenAI's Realtime API in Python☆208Updated 2 months ago
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆229Updated 5 months ago
- Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timm…☆130Updated 2 months ago
- ☆172Updated last year
- ⚡ Insanely fast AI voice assistant with <500ms response times☆362Updated last month
- ☆88Updated last week
- ☆115Updated 2 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆144Updated last week
- Open source inference code for Rev's model☆366Updated last week
- 📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)☆231Updated 2 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆142Updated this week
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆194Updated 3 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆209Updated 3 weeks ago
- Speech Diarization for scrum automation☆101Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆114Updated last week
- ☆289Updated last month