tdolan21 / openai-whisper-v3-apiLinks
FastAPI + Streamlit interface for OpenAI Whisper-large-v3 with youtube-to-mp3
☆27Updated 2 years ago
Alternatives and similar repositories for openai-whisper-v3-api
Users that are interested in openai-whisper-v3-api are comparing it to the libraries listed below
Sorting:
- Have a natural voice conversation with an LLM☆262Updated 3 months ago
- Tutorials from AutoGen Basics to Use Cases☆33Updated 2 years ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆86Updated 2 weeks ago
- ☆85Updated 2 years ago
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆50Updated this week
- Real time faster whisper gradio☆25Updated 4 months ago
- You can play any API server that compatible with OpenAI API☆24Updated last year
- ☆121Updated 2 years ago
- ☆175Updated 2 years ago
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆155Updated last year
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆83Updated 8 months ago
- ☆198Updated last year
- Python implementation of AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, w…☆49Updated 9 months ago
- This project fixes the Wav2Lip project so that it can run on Python 3.9. Wav2Lip is a project that can be used to lip-sync videos to audi…☆17Updated 2 years ago
- Open Source Text Embedding Models with OpenAI Compatible API☆165Updated last year
- "LightRAG: Simple and Fast Retrieval-Augmented Generation"☆59Updated last year
- ☆57Updated last year
- ☆76Updated last year
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆91Updated 2 years ago
- Local Powerpointer - A beautiful powerpoint generator which uses the power of local running large language models to generate the powerpo…☆283Updated 6 months ago
- ☆151Updated last year
- bisheng-unstructured library☆57Updated 7 months ago
- ☆74Updated last year
- ☆102Updated last year
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆243Updated 2 weeks ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Updated 11 months ago
- Multi-Agents & Plugins repo for DB-GPT, Can complete various tasks around databases.☆103Updated last year
- ☆252Updated 2 years ago
- Multimodal RAG with PyMuPDF☆43Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆17Updated last year