tdolan21 / openai-whisper-v3-apiLinks
FastAPI + Streamlit interface for OpenAI Whisper-large-v3 with youtube-to-mp3
☆26Updated last year
Alternatives and similar repositories for openai-whisper-v3-api
Users that are interested in openai-whisper-v3-api are comparing it to the libraries listed below
Sorting:
- Have a natural voice conversation with an LLM☆259Updated last month
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆84Updated last week
- You can play any API server that compatible with OpenAI API☆24Updated last year
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆155Updated last year
- ☆120Updated last year
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated this week
- Tutorials from AutoGen Basics to Use Cases☆32Updated last year
- ☆197Updated last year
- Local Powerpointer - A beautiful powerpoint generator which uses the power of local running large language models to generate the powerpo…☆277Updated 4 months ago
- ☆85Updated 2 years ago
- Real time faster whisper gradio☆26Updated 2 months ago
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆78Updated 5 months ago
- A gradio webui for Andrewyng translation-agent☆30Updated 11 months ago
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆291Updated 4 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆207Updated last year
- ☆113Updated last year
- ☆238Updated 5 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 9 months ago
- ASR + diarization model server with speculative decoding☆63Updated last year
- Code implement reposity of Paper HiQA☆103Updated 8 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆160Updated last year
- Multimodal RAG with PyMuPDF☆41Updated last year
- ☆57Updated last year
- ☆145Updated last year
- "LightRAG: Simple and Fast Retrieval-Augmented Generation"☆58Updated 11 months ago
- SmolDocling OCR App built using SmolDocling 256M Model and Streamlit.☆168Updated 7 months ago
- 🌟 Revolutionize Your Operations with One Sentence Automation: Utilizing large language models and Multi-Agents to generate operational c…☆56Updated 2 years ago
- ☆74Updated last year
- ☆251Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆139Updated last year