TEN-framework / ten-vad
A Low-Latency, Lightweight and High-Performance Streaming VAD
☆166Updated this week
Alternatives and similar repositories for ten-vad
Users that are interested in ten-vad are comparing it to the libraries listed below
Sorting:
- Turn detection for full-duplex dialogue communication☆59Updated this week
- We Speech Transcript based on LLM, in 300 lines of code.☆160Updated 3 weeks ago
- A toolkit for speaker diarization.☆188Updated this week
- A lightweight end-to-end text-to-speech model☆114Updated 2 months ago
- ☆158Updated 5 months ago
- ☆375Updated this week
- OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.☆363Updated this week
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆480Updated last week
- 🎤💬 Full example of implementing ChatGPT's realtime voice from scratch with VAD + STT + LLM + TTS technology stack within almost one fil…☆88Updated last week
- A real-time AI development framework leveraging WebRTC for audio and video transmission.☆131Updated 3 months ago
- GPT-4o-level, real-time spoken dialogue system.☆324Updated 3 months ago
- openai realtime webrtc python client☆42Updated 4 months ago
- Open source inference code for Rev's model☆401Updated 3 weeks ago
- Added vLLM support to IndexTTS for faster inference.☆87Updated this week
- ☆304Updated last week
- This project provides a RESTful API for converting text to speech using Microsoft's Azure Cognitive Services☆95Updated 11 months ago
- ☆195Updated 7 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated last month
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆53Updated 5 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆105Updated 3 weeks ago
- Speech Diarization for scrum automation☆104Updated last year
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆44Updated 4 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆91Updated 7 months ago
- Trans Router☆162Updated 4 months ago
- self hosted whisper api system based on container☆63Updated 8 months ago
- ☆16Updated 6 months ago
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆233Updated 8 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆100Updated 2 years ago
- ☆156Updated 6 months ago
- A unified interface for multiple Text-to-Speech (TTS) providers.☆268Updated 4 months ago