tdolan21 / openai-whisper-v3-apiLinks
FastAPI + Streamlit interface for OpenAI Whisper-large-v3 with youtube-to-mp3
☆27Updated 2 years ago
Alternatives and similar repositories for openai-whisper-v3-api
Users that are interested in openai-whisper-v3-api are comparing it to the libraries listed below
Sorting:
- Have a natural voice conversation with an LLM☆259Updated 2 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆87Updated this week
- ☆121Updated last year
- ☆85Updated 2 years ago
- Real time faster whisper gradio☆25Updated 4 months ago
- You can play any API server that compatible with OpenAI API☆24Updated last year
- Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…☆48Updated 8 months ago
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆155Updated last year
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆82Updated 7 months ago
- ☆198Updated last year
- Tutorials from AutoGen Basics to Use Cases☆33Updated 2 years ago
- a Dify tool for storing and retrieving long-term-memory, using Dify built-in Knowledge dataset for storing memories, each user has a stan…☆92Updated last year
- ☆57Updated last year
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆50Updated this week
- Jina DeepSearch UI☆126Updated 3 months ago
- ☆60Updated this week
- Local Powerpointer - A beautiful powerpoint generator which uses the power of local running large language models to generate the powerpo…☆283Updated 6 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆164Updated last year
- A gradio webui for Andrewyng translation-agent☆30Updated last year
- ☆76Updated last year
- 🌟 Revolutionize Your Operations with One Sentence Automation: Utilizing large language models and Multi-Agents to generate operational c…☆57Updated 2 years ago
- REFRAG-style RAG (compress → sense/select → expand) — Single-file reference implementation☆180Updated last week
- ASR + diarization model server with speculative decoding☆63Updated last year
- Effortlessly record, transcribe, and summarize meetings with this user-friendly desktop utility powered by OpenAI's Whisper and GPT-3.5-t…☆186Updated 2 years ago
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆91Updated 2 years ago
- ☆252Updated last year
- MinerU API server☆83Updated last year
- bisheng-unstructured library☆56Updated 7 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆115Updated last year
- ☆74Updated last year