badgids / transcription-appLinks
a transcription application that listens to audio input from the microphone using OpenAI's Whisper, transcribes it into text, and simulates typing the transcription in real-time wherever your cursor is on the screen. It can also do realtime translation.
☆21Updated last year
Alternatives and similar repositories for transcription-app
Users that are interested in transcription-app are comparing it to the libraries listed below
Sorting:
- Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-tex…☆58Updated 4 months ago
- An AI Discord bot that connects to a koboldcpp instance by API calls. Have a more intelligent Clyde Bot of your own making!☆39Updated 7 months ago
- Okra, your all in one personal AI assistant☆14Updated 11 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆34Updated 10 months ago
- ☆17Updated 5 months ago
- Seamless Voice Interactions with LLMs☆12Updated last year
- time based thinking and structure like OpenAI's o1 preview.☆10Updated 8 months ago
- An API for VoiceCraft.☆25Updated 11 months ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆35Updated last week
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆18Updated 11 months ago
- ☆50Updated 6 months ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆41Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆79Updated 7 months ago
- 100% Local Document deep search with LLMs☆26Updated 9 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 2 months ago
- Interact with a AI Game-engine that keep building its rules and world as you play, adapted to your gameplay.☆45Updated last week
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 9 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated last year
- ☆91Updated 3 weeks ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 5 months ago
- Garvis: Realtime AI Voice Assistant☆38Updated last year
- ☆72Updated last year
- ☆22Updated 4 months ago
- LLM backed Fantasy Tribe Game☆18Updated 6 months ago
- ☆16Updated last week
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆56Updated last month
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆22Updated 11 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 6 months ago