badgids / transcription-app
a transcription application that listens to audio input from the microphone using OpenAI's Whisper, transcribes it into text, and simulates typing the transcription in real-time wherever your cursor is on the screen. It can also do realtime translation.
☆21Updated last year
Alternatives and similar repositories for transcription-app:
Users that are interested in transcription-app are comparing it to the libraries listed below
- Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-tex…☆58Updated 3 months ago
- Okra, your all in one personal AI assistant☆14Updated 10 months ago
- ☆48Updated 6 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 4 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆54Updated 8 months ago
- ☆64Updated last month
- ☆71Updated last year
- ☆18Updated 8 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- A TTS extension for oobabooga text WebUI☆31Updated last year
- An AI Discord bot that connects to a koboldcpp instance by API calls. Have a more intelligent Clyde Bot of your own making!☆37Updated 6 months ago
- ☆17Updated 4 months ago
- Seamless Voice Interactions with LLMs☆12Updated last year
- Discord chatbot interface to train an LLM on user message history☆27Updated last year
- Unofficial package to easily interact with the Kits.AI API☆10Updated last year
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆30Updated this week
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆41Updated last year
- ☆40Updated last year
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆20Updated 6 months ago
- Performs the entire AI cover generation process with UI☆17Updated last week
- LIVA - Local Intelligent Voice Assistant☆61Updated 8 months ago
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆51Updated 10 months ago
- Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs☆23Updated 2 weeks ago
- Fetch related searches for a given query from Google Trends, and for each related search, it retrieves the date it was most popular and i…☆12Updated last month
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 10 months ago
- ☆91Updated 3 months ago
- Run AuraFlow on Replicate☆14Updated 9 months ago
- Local & private voice controlled notepad using whisper.cpp☆24Updated last year
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆62Updated last month
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago