di-osc / livekit-plugins-chineseLinks
livekit agent plugins
☆33Updated 3 weeks ago
Alternatives and similar repositories for livekit-plugins-chinese
Users that are interested in livekit-plugins-chinese are comparing it to the libraries listed below
Sorting:
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆87Updated last month
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 3 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆187Updated 2 months ago
- 基于DINet的推理服务,推理视频流和视频☆16Updated 2 years ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆47Updated 9 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆172Updated 11 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆43Updated last year
- 基于MuseTalk的数字人代码。☆35Updated last year
- simple and fast wav2lip using onnx models for face-detection and inference. Easy installation☆28Updated last year
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated 2 years ago
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆128Updated 2 months ago
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆42Updated last year
- PersonaTalk Hack☆15Updated last year
- This repository provides a Docker image for CosyVoice☆27Updated last year
- This project fixes the Wav2Lip project so that it can run on Python 3.9. Wav2Lip is a project that can be used to lip-sync videos to audi…☆17Updated 2 years ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 5 months ago
- Generate ARKit expression from audio in realtime☆184Updated 3 months ago
- This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital hum…☆55Updated last year
- A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior☆36Updated 2 years ago
- ☆146Updated last month
- ☆12Updated 2 years ago
- Just a suturing monster project.☆42Updated 2 years ago
- ☆14Updated last year
- project page for ChatAnyone☆116Updated 10 months ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆42Updated 10 months ago
- ☆82Updated 3 weeks ago
- optimized wav2lip☆18Updated 2 years ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Updated 11 months ago
- Running the F5-TTS by ONNX Runtime☆191Updated 3 weeks ago
- Real time faster whisper gradio☆25Updated 5 months ago