gpustack / vox-box
A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
☆109Updated 3 weeks ago
Alternatives and similar repositories for vox-box
Users that are interested in vox-box are comparing it to the libraries listed below
Sorting:
- LM inference server implementation based on *.cpp.☆185Updated this week
- xllamacpp - a Python wrapper of llama.cpp☆36Updated last week
- ☆88Updated 2 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 3 months ago
- Real time faster whisper gradio☆26Updated 7 months ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆50Updated this week
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆161Updated this week
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆103Updated 3 weeks ago
- ☆142Updated 2 months ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated last month
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆47Updated this week
- Jina DeepSearch UI☆104Updated this week
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆26Updated this week
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆31Updated 8 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆78Updated 4 months ago
- ☆289Updated last week
- Port of Facebook's LLaMA model in C/C++☆52Updated 2 weeks ago
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆72Updated 6 months ago
- Receipts for creating AI Applications with APIs from DashScope (and friends)!☆51Updated 7 months ago
- ChatTTS-OpenAI-API is a project built upon the ChatTTS project, implementing the v1/audio/speech endpoint in compliance with OpenAI proto…☆21Updated 11 months ago
- WIP. Apps (100+) + AI.☆28Updated 8 months ago
- bisheng-unstructured library☆46Updated last week
- ☆53Updated 4 months ago
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆12Updated 3 months ago
- Full list of LLM API with Internet Access☆70Updated 3 months ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆44Updated 8 months ago
- ☆142Updated this week
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆33Updated this week
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 5 months ago
- LLM voice chat project by Connect ChatTTS with Local Ollama, 连接本地部署的 Ollama 和 ChatTTS,实现和LLM的语音对话☆62Updated 9 months ago