tdolan21 / openai-whisper-v3-apiLinks
FastAPI + Streamlit interface for OpenAI Whisper-large-v3 with youtube-to-mp3
☆26Updated last year
Alternatives and similar repositories for openai-whisper-v3-api
Users that are interested in openai-whisper-v3-api are comparing it to the libraries listed below
Sorting:
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆81Updated this week
- Have a natural voice conversation with an LLM☆257Updated last week
- ☆120Updated last year
- ☆85Updated 2 years ago
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated this week
- ☆103Updated last year
- Tutorials from AutoGen Basics to Use Cases☆32Updated last year
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务, 支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆76Updated 5 months ago
- ☆197Updated last year
- ☆59Updated this week
- A gradio webui for Andrewyng translation-agent☆30Updated 10 months ago
- Real time faster whisper gradio☆26Updated 2 months ago
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆290Updated 3 months ago
- You can play any API server that compatible with OpenAI API☆24Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆17Updated last year
- ASR + diarization model server with speculative decoding☆63Updated last year
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆155Updated last year
- a Dify tool for storing and retrieving long-term-memory, using Dify built-in Knowledge dataset for storing memories, each user has a stan…☆91Updated last year
- SmolDocling OCR App built using SmolDocling 256M Model and Streamlit.☆167Updated 6 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆160Updated last year
- ☆112Updated last year
- MinerU API server☆74Updated 9 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 8 months ago
- 🌟 Revolutionize Your Operations with One Sentence Automation: Utilizing large language models and Multi-Agents to generate operational c…☆56Updated last year
- ☆76Updated last year
- ☆57Updated last year
- Multimodal RAG with PyMuPDF☆40Updated last year
- Local Powerpointer - A beautiful powerpoint generator which uses the power of local running large language models to generate the powerpo…☆275Updated 4 months ago
- 与 https://github.com/tonori/mem0ai-api 配合使用的非官方的 mem0ai provider.☆47Updated last year
- ☆74Updated last year