liu-qingyuan / faster_whisper_gradioLinks
Real time faster whisper gradio
☆26Updated 2 months ago
Alternatives and similar repositories for faster_whisper_gradio
Users that are interested in faster_whisper_gradio are comparing it to the libraries listed below
Sorting:
- Have a natural voice conversation with an LLM☆258Updated 2 weeks ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 8 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated last year
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆79Updated 11 months ago
- A gradio webui for Andrewyng translation-agent☆30Updated 10 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆69Updated 2 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆156Updated 6 months ago
- xllamacpp - a Python wrapper of llama.cpp☆60Updated 2 weeks ago
- ☆21Updated 11 months ago
- Jina DeepSearch UI☆126Updated 2 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆165Updated 3 months ago
- a Dify plugin to convert markdown text into .pptx file☆20Updated 7 months ago
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆81Updated 3 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆80Updated 10 months ago
- Datalore is an AI-powered Data Analysis tool that integrates Anthropic's Claude API with various data analysis libraries and custom funct…☆42Updated 8 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆83Updated this week
- openai realtime webrtc python client☆45Updated 9 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated last year
- Async MCP server with Minimax API integration for image generation and text-to-speech☆51Updated last month
- ☆77Updated 6 months ago
- 我们是第一个完全可商用的角色大模型。☆40Updated last year
- ☆166Updated 10 months ago
- This is a multi-character, ultra-personalized StoryTeller. It includes: 1) efficiently and accurately build multi-character voice library…☆55Updated 8 months ago
- A FastAPI service for text-to-speech synthesis using the F5-TTS model. Includes authentication token☆34Updated 6 months ago
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆97Updated last year
- 与 https://github.com/tonori/mem0ai-api 配合使用的非官方的 mem0ai provider.☆47Updated last year
- an open source ai stylist☆75Updated 3 months ago
- Examples for QinYan GLMs☆13Updated last year
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活 动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆77Updated 5 months ago
- Code for ACL25-findings. An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social g…☆84Updated 2 months ago