stepfun-ai / Step-Realtime-ConsoleLinks
Step-Realtime-Console
☆52Updated last month
Alternatives and similar repositories for Step-Realtime-Console
Users that are interested in Step-Realtime-Console are comparing it to the libraries listed below
Sorting:
- GPT-4o-level, real-time spoken dialogue system.☆359Updated 8 months ago
- ☆204Updated last year
- 一个用于CosyVoice的api接口项目☆313Updated last month
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆644Updated 3 months ago
- MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting…☆984Updated 3 weeks ago
- ☆459Updated 5 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆172Updated 8 months ago
- 实时STT ,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆420Updated 9 months ago
- Pseudo Streaming SenseVoice with Hotwords☆366Updated 7 months ago
- Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…☆1,174Updated last month
- ☆370Updated last year
- RTC AIGC Demo☆211Updated 3 weeks ago
- ☆466Updated 5 months ago
- ☆52Updated last month
- ☆320Updated 6 months ago
- 使用vllm加速cosyvoice2的推理☆430Updated 5 months ago
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆293Updated 3 weeks ago
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆456Updated 11 months ago
- 基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。☆540Updated 5 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆155Updated 6 months ago
- Added vLLM support to IndexTTS for faster inference.☆752Updated this week
- Long-form streaming TTS system for multi-speaker dialogue generation☆832Updated last week
- ☆785Updated last year
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆76Updated 5 months ago
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆69Updated 3 months ago
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆674Updated last month
- We Speech Transcript based on LLM, in 300 lines of code.☆177Updated 4 months ago
- A toolkit for speaker diarization.☆311Updated 2 weeks ago
- OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.☆442Updated 2 weeks ago
- CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)☆170Updated 7 months ago