liu-qingyuan / faster_whisper_gradioLinks
Real time faster whisper gradio
☆26Updated 9 months ago
Alternatives and similar repositories for faster_whisper_gradio
Users that are interested in faster_whisper_gradio are comparing it to the libraries listed below
Sorting:
- Have a natural voice conversation with an LLM☆252Updated 7 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated 10 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 5 months ago
- Jina DeepSearch UI☆120Updated last month
- A NextJS based app that takes a user prompt, or a YouTube url, or a Website URL, and generates a beautiful Mindmap.☆118Updated 5 months ago
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆75Updated 9 months ago
- Datalore is an AI-powered Data Analysis tool that integrates Anthropic's Claude API with various data analysis libraries and custom funct…☆40Updated 5 months ago
- a Dify plugin to convert markdown text into .pptx file☆19Updated 4 months ago
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆134Updated 3 months ago
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆78Updated 2 weeks ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆65Updated 2 months ago
- An agentic workflow for story book generation☆30Updated 4 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆64Updated this week
- ☆19Updated 8 months ago
- A gradio webui for Andrewyng translation-agent☆29Updated 8 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆146Updated 2 weeks ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆38Updated 9 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated 10 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆112Updated 9 months ago
- 一个基于Together AI的强大图像生成工具,支持文生图、图生图和提示词分析功能。☆24Updated 8 months ago
- Examples for QinYan GLMs☆13Updated 11 months ago
- openai realtime webrtc python client☆45Updated 7 months ago
- ☆77Updated 3 months ago
- 一个用于F5-TTS的api和webui项目☆61Updated 7 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆123Updated 10 months ago
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆96Updated last year
- An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social graphs.☆80Updated 3 weeks ago
- an open source ai stylist☆67Updated last month
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆62Updated last month
- Async MCP server with Minimax API integration for image generation and text-to-speech☆49Updated last week