joshuaboniface / remote-faster-whisper
A basic HTTP API for handling Faster Whisper audio transcriptions over the network
☆28Updated 5 months ago
Alternatives and similar repositories for remote-faster-whisper:
Users that are interested in remote-faster-whisper are comparing it to the libraries listed below
- OpenAI Whisper API-style local server, runnig on FastAPI☆77Updated 4 months ago
- Docker images and configuration to run text-generation-webui with GPU or CPU support☆29Updated last year
- Easily create LLM automation/agent workflows☆59Updated last year
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆27Updated 3 months ago
- Docker configuration for koboldcpp☆34Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 4 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆65Updated 3 weeks ago
- LLM Chat is an open-source serverless alternative to ChatGPT.☆33Updated 7 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- ☆24Updated 3 months ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆30Updated last month
- An OpenAI API compatible images server to generate or manipulate images.☆16Updated 2 months ago
- This project aims to combine the latest LLMs, Multi-Step Asynchronous Function Calling, Natural Language Processing, and Text-to-Speech.☆37Updated last year
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 6 months ago
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆43Updated last year
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆35Updated this week
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆107Updated 2 months ago
- ☆28Updated 6 months ago
- A third-party package manager for OpenWebUI☆29Updated 9 months ago
- Run Ollama LLM models in Google Colab for free☆33Updated 5 months ago
- Augment AI agents with long-term memory through knowledge graph 🧠☆64Updated 6 months ago
- Whisperx API implementation☆27Updated 11 months ago
- API server for Instant voice cloning by MyShell.☆89Updated 7 months ago
- Simulates talk with an AI that can express emotions☆65Updated 9 months ago
- Python app for LM Studio-enhanced voice conversations with local LLMs. Uses Whisper for speech-to-text and offers a privacy-focused, acce…☆92Updated 11 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 8 months ago
- Bookmarklet to pull and run hugging face GGUF models in Ollama☆14Updated 6 months ago
- Polyglot is a fast, elegant, and free translation tool using AI.☆60Updated 7 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 3 weeks ago
- ☆22Updated 8 months ago