liu-qingyuan / faster_whisper_gradioLinks
Real time faster whisper gradio
☆25Updated 4 months ago
Alternatives and similar repositories for faster_whisper_gradio
Users that are interested in faster_whisper_gradio are comparing it to the libraries listed below
Sorting:
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Updated 11 months ago
- Have a natural voice conversation with an LLM☆261Updated 3 months ago
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆82Updated last year
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Updated last year
- A gradio webui for Andrewyng translation-agent☆30Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆86Updated 2 weeks ago
- This is a multi-character, ultra-personalized StoryTeller. It includes: 1) efficiently and accurately build multi-character voice library…☆58Updated 11 months ago
- ☆21Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆184Updated 2 months ago
- A FastAPI service for text-to-speech synthesis using the F5-TTS model. Includes authentication token☆36Updated 8 months ago
- Jina DeepSearch UI☆127Updated 4 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆43Updated last year
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆84Updated 5 months ago
- 一个用于F5-TTS的api和webui项目☆65Updated last year
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆82Updated last year
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆173Updated 11 months ago
- openai realtime webrtc python client☆47Updated last year
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆187Updated 3 weeks ago
- an open source ai stylist☆77Updated 6 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated last year
- Datalore is an AI-powered Data Analysis tool that integrates Anthropic's Claude API with various data analysis libraries and custom funct…☆42Updated 10 months ago
- An agentic workflow for story book generation☆31Updated 10 months ago
- a Dify plugin to convert markdown text into .pptx file☆25Updated 9 months ago
- Async MCP server with Minimax API integration for image generation and text-to-speech☆51Updated 2 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆71Updated 5 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆141Updated 4 months ago
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆99Updated last year
- ☆112Updated last year
- A NextJS based app that takes a user prompt, or a YouTube url, or a Website URL, and generates a beautiful Mindmap.☆123Updated 10 months ago
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆156Updated last year