liu-qingyuan / faster_whisper_gradioLinks
Real time faster whisper gradio
☆25Updated 3 months ago
Alternatives and similar repositories for faster_whisper_gradio
Users that are interested in faster_whisper_gradio are comparing it to the libraries listed below
Sorting:
- Have a natural voice conversation with an LLM☆258Updated last month
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 9 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Updated last year
- 用于SenseVoice的api项目,输出带时间戳字幕☆43Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆169Updated last month
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆88Updated this week
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆80Updated last year
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆82Updated 4 months ago
- This is a multi-character, ultra-personalized StoryTeller. It includes: 1) efficiently and accurately build multi-character voice library…☆57Updated 10 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆177Updated 4 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆70Updated 3 months ago
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆99Updated last year
- ☆21Updated last year
- A NextJS based app that takes a user prompt, or a YouTube url, or a Website URL, and generates a beautiful Mindmap.☆122Updated 9 months ago
- openai realtime webrtc python client☆46Updated 11 months ago
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆32Updated last year
- xllamacpp - a Python wrapper of llama.cpp☆66Updated last week
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆81Updated 11 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated last year
- Examples for QinYan GLMs☆13Updated last year
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆140Updated 3 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆173Updated 9 months ago
- 一个用于F5-TTS的api和webui项目☆64Updated 11 months ago
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year
- ☆12Updated last year
- Jina DeepSearch UI☆126Updated 3 months ago
- coze api to openai☆15Updated last year
- A gradio webui for Andrewyng translation-agent☆30Updated last year
- Code for ACL25-findings. An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social g…☆88Updated last month
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆86Updated 8 months ago