heimoshuiyu / whisper-fastapiLinks
A very simple whsper Python FastAPI for OpenAI API, Android voice-typing (konele), Home Assistant (wyoming), and a voice-typing script on Linux and MacOS!
☆37Updated 6 months ago
Alternatives and similar repositories for whisper-fastapi
Users that are interested in whisper-fastapi are comparing it to the libraries listed below
Sorting:
- Real time faster whisper gradio☆26Updated 2 months ago
 - OpenAI Whisper API-style local server, runnig on FastAPI☆86Updated 3 weeks ago
 - Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆80Updated 11 months ago
 - Have a natural voice conversation with an LLM☆260Updated 3 weeks ago
 - Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆156Updated this week
 - The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installation☆155Updated last year
 - Speech Diarization for scrum automation☆111Updated 2 years ago
 - A gradio webui for Andrewyng translation-agent☆30Updated 10 months ago
 - Use LLM (ollama, QWEN, ChatGPT) to translate the pdf inplacely☆58Updated 5 months ago
 - Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆30Updated last month
 - A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆169Updated 3 months ago
 - TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆47Updated last year
 - OpenAI API and Whisper based Video Translation☆74Updated 10 months ago
 - ☆112Updated last year
 - Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆134Updated 2 months ago
 - Conversational Retrieval Evaluation Dataset☆101Updated 2 months ago
 - SmolDocling OCR App built using SmolDocling 256M Model and Streamlit.☆168Updated 7 months ago
 - g1: Using GPT-4o to create o1-like reasoning chains☆20Updated last year
 - AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆81Updated 3 months ago
 - A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆127Updated last week
 - Using GPT to parse PDF☆101Updated last year
 - This repository contains a web application designed to execute relatively compact, locally-operated Large Language Models (LLMs).☆43Updated 7 months ago
 - 一个LightRAG的API模拟器,用于在Openwebui中通过自带的Ollama接口使用LightRAG;通过对话时使用前缀,还可以实现lightrag的模式切换。☆25Updated 11 months ago
 - ☆93Updated 3 months ago
 - Pipeline for converting PDFs to raw text with PaddleOCR☆23Updated 2 years ago
 - An JS web client for connecting to Pipecat bots with voice and vision☆45Updated 10 months ago
 - ☆174Updated last year
 - WIP. Apps (100+) + AI.☆30Updated last year
 - An agentic workflow for story book generation☆31Updated 7 months ago
 - Docker compose to run vLLM on Windows☆104Updated last year