heimoshuiyu / whisper-fastapiLinks
A very simple whsper Python FastAPI for OpenAI API, Android voice-typing (konele), Home Assistant (wyoming), and a voice-typing script on Linux and MacOS!
☆31Updated 4 months ago
Alternatives and similar repositories for whisper-fastapi
Users that are interested in whisper-fastapi are comparing it to the libraries listed below
Sorting:
- Real time faster whisper gradio☆26Updated last month
- FastAPI service on top of WhisperX☆129Updated this week
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆151Updated 5 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆161Updated 2 months ago
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆78Updated 10 months ago
- Have a natural voice conversation with an LLM☆256Updated 9 months ago
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆79Updated 2 months ago
- OpenAI Whisper API-style local server, runnig on FastAPI☆84Updated 9 months ago
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year
- A gradio webui for Andrewyng translation-agent☆30Updated 9 months ago
- Multimodal RAG with PyMuPDF☆40Updated 11 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆74Updated this week
- SmolDocling OCR App built using SmolDocling 256M Model and Streamlit.☆160Updated 5 months ago
- WIP. Apps (100+) + AI.☆30Updated last year
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆30Updated this week
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆125Updated this week
- Using GPT to parse PDF☆101Updated last year
- Speech Diarization for scrum automation☆111Updated 2 years ago
- Use LLM (ollama, QWEN, ChatGPT) to translate the pdf inplacely☆52Updated 4 months ago
- DeepSearch Code-Actions Agent (DSCA). Build 🙌 with 🤗 smolagents☆117Updated last month
- ☆92Updated 2 months ago
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆156Updated 11 months ago
- Datalore is an AI-powered Data Analysis tool that integrates Anthropic's Claude API with various data analysis libraries and custom funct…☆42Updated 6 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆129Updated 2 weeks ago
- 🧠 世界上覆盖最全的优秀Qwen提示语大全,欢迎贡献你的提示词。🧠 The most comprehensive collection of excellent Qwen prompts in the world. Feel free to contribute you…☆269Updated last month
- Auto Thinking Mode switch for Qwen3 in Open webui☆68Updated 4 months ago
- ☆53Updated 9 months ago
- WhisperX Service love docker!☆16Updated last year
- 02. Enabling various applications to be AI-enabled or used by AI.☆29Updated last year
- Open Sourced NoteBookLM☆59Updated 11 months ago