liu-qingyuan / faster_whisper_gradioLinks

Real time faster whisper gradio

☆26

Alternatives and similar repositories for faster_whisper_gradio

Users that are interested in faster_whisper_gradio are comparing it to the libraries listed below

Sorting:

Finity-Alpha / OpenVoiceChat
Have a natural voice conversation with an LLM
☆252Updated 7 months ago
AgentEra / Agently-Talk-to-Control
An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.
☆28Updated 10 months ago
LB-Young / Bambo
Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…
☆35Updated 5 months ago
jina-ai / deepsearch-ui
Jina DeepSearch UI
☆120Updated last month
misbahsy / MindMapper
A NextJS based app that takes a user prompt, or a YouTube url, or a Website URL, and generates a beautiful Mindmap.
☆118Updated 5 months ago
dubeno / NotebookLLM-Chinese
Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai
☆75Updated 9 months ago
micic-mihajlo / Datalore
Datalore is an AI-powered Data Analysis tool that integrates Anthropic's Claude API with various data analysis libraries and custom funct…
☆40Updated 5 months ago
stvlynn / PPT-Dify-Plugin
a Dify plugin to convert markdown text into .pptx file
☆19Updated 4 months ago
ruzhila / voiceapi
Streaming ASR and TTS based on FastAPI+ sherpa-onnx
☆134Updated 3 months ago
ETomberg391 / Ecne-AI-Podcaster
AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast
☆78Updated 2 weeks ago
chentuochao / Spatial-Speech-Translation
The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"
☆65Updated 2 months ago
breakstring / Agentic_Story_Book_Workflow
An agentic workflow for story book generation
☆30Updated 4 months ago
ai-bot-pro / achatbot
An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.
☆64Updated this week
leduclinh7141 / BetterWhisperX
☆19Updated 8 months ago
snekkenull / translation-agent-webui
A gradio webui for Andrewyng translation-agent
☆29Updated 8 months ago
gpustack / vox-box
A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
☆146Updated 2 weeks ago
jianchang512 / sense-api
用于SenseVoice的api项目，输出带时间戳字幕
☆38Updated 9 months ago
Theigrams / g1
g1: Using GPT-4o to create o1-like reasoning chains
☆20Updated 10 months ago
bklieger-groq / gradio-groq-basics
Building Blocks for Multi-Modal Gradio Powered by Groq Apps
☆112Updated 9 months ago
nicekate / Together-Flux-Studio
一个基于Together AI的强大图像生成工具，支持文生图、图生图和提示词分析功能。
☆24Updated 8 months ago
MetaGLM / qingyan-cookbook
Examples for QinYan GLMs
☆13Updated 11 months ago
realtime-ai / openai-realtime-webrtc-python
openai realtime webrtc python client
☆45Updated 7 months ago
maitrix-org / easyweb
☆77Updated 3 months ago
jianchang512 / f5-tts-api
一个用于F5-TTS的api和webui项目
☆61Updated 7 months ago
AlexisBalayre / AI-Powered-Meeting-Summarizer
Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.
☆123Updated 10 months ago
v3ucn / OpenVoiceV2_Webui_resemble_enhance
基于OpenVoice和Melotts整合的中文版webui，添加resemble_enhance音频增强功能
☆96Updated last year
Ji-Cather / GraphAgent
An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social graphs.
☆80Updated 3 weeks ago
parsakhaz / open-ai-stylist
an open source ai stylist
☆67Updated last month
byteresearchcla / RealSI
RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios
☆62Updated last month
PsychArch / minimax-mcp-tools
Async MCP server with Minimax API integration for image generation and text-to-speech
☆49Updated last week