realtime-ai / realtime-audio-sdkLinks
Realtime Audio SDK for the Web — audio capture, echo cancellation (AEC), voice activity detection (VAD), and real-time encoding (Opus/PCM).
☆116Updated last month
Alternatives and similar repositories for realtime-audio-sdk
Users that are interested in realtime-audio-sdk are comparing it to the libraries listed below
Sorting:
- A real-time Agent framework for audio and video.☆165Updated this week
- a super fast llm response using small llm model to prefix large llm model☆237Updated 3 months ago
- ☆50Updated 2 months ago
- coze api to openai☆15Updated last year
- 🎤💬 Full example of implementing ChatGPT's realtime voice from scratch with VAD + STT + LLM + TTS technology stack within almost one fil…☆133Updated 2 weeks ago
- ☆170Updated last year
- Trans Router☆166Updated 10 months ago
- Big map for Google I/O 2025☆31Updated 5 months ago
- ☆70Updated last year
- This project provides a RESTful API for converting text to speech using Microsoft's Azure Cognitive Services☆96Updated last year
- 短信转发器——监控Android手机短信、来电、APP通知,并根据指定规则转发到其他手机:钉钉群自定义机器人、钉钉企业内机器人、企业微信群机器人、飞书机器人、企业微信应用消息、邮箱、bark、webhook、Telegram机器人、Server酱、PushPlus、手机短信…☆46Updated 2 years ago
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆39Updated 11 months ago
- 智能视频处理系统☆48Updated 10 months ago
- Generated ppt by AI based on RevealJS synax☆145Updated last year
- A duolingo opensource alternative. RIP Duo🕯️☆63Updated 9 months ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆124Updated last week
- 一个使用 Cloudflare 浏览器渲染功能从任何网站提取图片的现代 Web 应用。基于 Remix 构建并部署在 Cloudflare Pages 上。☆134Updated 11 months ago
- A MCP server for automated website deployment to 1Panel (Experimental)☆33Updated 4 months ago
- Talk to Type☆65Updated 4 months ago
- PDF2MD是一个高效的PDF到Markdown转换工具,旨在帮助用户轻松将PDF文档转换为Markdown格式,便于编辑、分享和发布。通过简洁易用的界面和强大的转换功能,PDF2MD成为内容创作者、研究人员和开发者的得力助手。☆172Updated last month
- 儿童有声读物的智能化自动化合生成,使 用通义千问大模型+ Cosyvoice声音合成 + Flux 图像生成 + Paraformer 声音识别合成可用于生产的儿童有声读物☆103Updated 2 months ago
- ☆29Updated 7 months ago
- A class for generating realistic audio (TTS) for podcasts and dialogues.☆63Updated 11 months ago
- Model Context Protocol服务器,用于抓取微博用户信息、动态和搜索功能☆31Updated 2 months ago
- chrome插件 + electron + node + react => 逻辑流程图网页自动化工具☆41Updated 3 weeks ago
- ☆33Updated last year
- from Google AI Studio☆145Updated last month
- ☆48Updated 7 months ago
- self hosted whisper api system based on container☆64Updated last year
- 记录文生图、文生视频、大语言模型等 AI 相关技术在发展过程中的重要时间点☆80Updated 3 months ago