di-osc / livekit-plugins-chineseLinks
livekit agent plugins
☆21Updated last month
Alternatives and similar repositories for livekit-plugins-chinese
Users that are interested in livekit-plugins-chinese are comparing it to the libraries listed below
Sorting:
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆103Updated 2 weeks ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆81Updated this week
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆41Updated 6 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 8 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆172Updated 8 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆155Updated 5 months ago
- Real time faster whisper gradio☆26Updated 2 months ago
- Qwen-TTS offers a robust voice synthesis service using FastAPI, supporting bilingual and dialect options. Explore seamless audio generati…☆74Updated this week
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 2 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated last year
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆34Updated 10 months ago
- ☆14Updated 10 months ago
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆14Updated last year
- We Speech Transcript based on LLM, in 300 lines of code.☆177Updated 4 months ago
- A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior☆33Updated last year
- 用于SenseVoice的api项目,输出带时间戳字幕☆41Updated 11 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆69Updated 2 months ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆25Updated last year
- xllamacpp - a Python wrapper of llama.cpp☆60Updated last week
- A lightweight end-to-end text-to-speech model☆121Updated 7 months ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆39Updated 6 months ago
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆17Updated 4 months ago
- Real time streaming digital human based on nerf☆17Updated last year
- ☆166Updated 10 months ago
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆106Updated 2 weeks ago
- ☆10Updated 2 years ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Updated last year
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆38Updated last year
- openai realtime webrtc python client☆45Updated 9 months ago
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆32Updated last year