gpustack / vox-box
A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
☆99Updated last week
Alternatives and similar repositories for vox-box:
Users that are interested in vox-box are comparing it to the libraries listed below
- LM inference server implementation based on *.cpp.☆169Updated this week
- xllamacpp - a Python wrapper of llama.cpp☆34Updated last week
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 2 months ago
- Real time faster whisper gradio☆26Updated 6 months ago
- ☆85Updated last month
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆27Updated 7 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆145Updated last week
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆26Updated last week
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆71Updated 5 months ago
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆21Updated last week
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆146Updated 6 months ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated last month
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 4 months ago
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆12Updated 2 months ago
- WIP. Apps (100+) + AI.☆28Updated 7 months ago
- MinerU API server☆51Updated 4 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆95Updated 6 months ago
- bisheng-unstructured library☆44Updated last week
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆30Updated 7 months ago
- ☆140Updated 2 months ago
- ☆24Updated 3 months ago
- Jina DeepSearch UI☆95Updated last week
- Receipts for creating AI Applications with APIs from DashScope (and friends)!☆51Updated 6 months ago
- automatically quant GGUF models☆167Updated last week
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆30Updated 2 weeks ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆78Updated 3 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆88Updated 6 months ago
- Speech Diarization for scrum automation☆102Updated last year
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆25Updated last year
- ☆28Updated 6 months ago