tdolan21 / openai-whisper-v3-apiLinks
FastAPI + Streamlit interface for OpenAI Whisper-large-v3 with youtube-to-mp3
☆25Updated last year
Alternatives and similar repositories for openai-whisper-v3-api
Users that are interested in openai-whisper-v3-api are comparing it to the libraries listed below
Sorting:
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆68Updated this week
- Have a natural voice conversation with an LLM☆255Updated 8 months ago
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆154Updated 10 months ago
- ☆119Updated last year
- ☆198Updated last year
- ☆57Updated 10 months ago
- Real time faster whisper gradio☆26Updated 2 weeks ago
- SmolDocling OCR App built using SmolDocling 256M Model and Streamlit.☆158Updated 5 months ago
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆290Updated 2 months ago
- ☆83Updated 2 years ago
- A gradio webui for Andrewyng translation-agent☆29Updated 9 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆111Updated 9 months ago
- Tutorials from AutoGen Basics to Use Cases☆32Updated last year
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆64Updated 3 months ago
- Local Powerpointer - A beautiful powerpoint generator which uses the power of local running large language models to generate the powerpo…☆268Updated 2 months ago
- A function calling tool can be deployed to Cloudflare Workers with openapi schema☆99Updated last year
- Cook up amazing multimodal AI applications effortlessly with MiniCPM-o☆137Updated this week
- Easy to deploy.A cloud service for python code interpreter sandbox for Code-Interpreter.☆54Updated last year
- Multimodal RAG with PyMuPDF☆40Updated 11 months ago
- ☆175Updated last year
- You can play any API server that compatible with OpenAI API☆24Updated last year
- ☆76Updated last year
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 5 months ago
- a Dify tool for storing and retrieving long-term-memory, using Dify built-in Knowledge dataset for storing memories, each user has a stan…☆90Updated last year
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 6 months ago
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated this week
- A third-party component library based on Gradio. Integrates Ant Design, Ant Design X, and more advanced components to help you build appl…☆115Updated last week
- "LightRAG: Simple and Fast Retrieval-Augmented Generation"☆58Updated 9 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆204Updated 10 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated 11 months ago