tdolan21 / openai-whisper-v3-api
FastAPI + Streamlit interface for OpenAI Whisper-large-v3 with youtube-to-mp3
☆23Updated last year
Alternatives and similar repositories for openai-whisper-v3-api:
Users that are interested in openai-whisper-v3-api are comparing it to the libraries listed below
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆33Updated this week
- bisheng-unstructured library☆42Updated last week
- ☆74Updated 11 months ago
- Real time faster whisper gradio☆26Updated 5 months ago
- ☆173Updated last year
- 研究GOT-OCR-项目落地加 速,不限语言☆59Updated 5 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated last month
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated last year
- Tutorials from AutoGen Basics to Use Cases☆29Updated last year
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆47Updated this week
- AGI 模块库架构图☆75Updated last year
- ☆82Updated last year
- 与 https://github.com/tonori/mem0ai-api 配合使用的非官方的 mem0ai provider.☆49Updated 8 months ago
- Multimodal LLM Application with PyMuPDF4LLM☆36Updated 5 months ago
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆143Updated 5 months ago
- ☆26Updated 5 months ago
- Agentica: Effortlessly Build Intelligent, Reflective, and Collaborative Multimodal AI Agents! 轻松构建智能、具备反思能力、可协作的多模态AI Agent。☆144Updated 3 weeks ago
- ☆59Updated 8 months ago
- ☆109Updated 7 months ago
- Have a natural voice conversation with an LLM☆246Updated 3 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆126Updated 9 months ago
- 阅读顺序、Layoutreader☆11Updated 10 months ago
- ☆58Updated 5 months ago
- Analysis of Chinese and English layouts 中英文版面分析☆185Updated last week
- Simple example to showcase how to use llamaparser to parse PDF files☆83Updated 6 months ago
- 演示 vllm 对中文大语言模型的神奇效果☆31Updated last year
- TianMu: A modern AI tool with multi-platform support, markdown support, multimodal, continuous conversation, and customizable commands. 一…☆83Updated last year
- A function calling tool can be deployed to Cloudflare Workers with openapi schema☆90Updated 8 months ago
- Chat with any website on your local machine☆72Updated 9 months ago
- ☆59Updated 5 months ago