badgids / transcription-appLinks
a transcription application that listens to audio input from the microphone using OpenAI's Whisper, transcribes it into text, and simulates typing the transcription in real-time wherever your cursor is on the screen. It can also do realtime translation.
☆21Updated last year
Alternatives and similar repositories for transcription-app
Users that are interested in transcription-app are comparing it to the libraries listed below
Sorting:
- Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-tex…☆58Updated 5 months ago
- An AI Discord bot that connects to a koboldcpp instance by API calls. Have a more intelligent Clyde Bot of your own making!☆39Updated 8 months ago
- Create storybooks using CrewAI, Groq, and Ollama☆21Updated last year
- Okra, your all in one personal AI assistant☆14Updated last year
- time based thinking and structure like OpenAI's o1 preview.☆10Updated 9 months ago
- Using LLMs and rules for a local personal agent☆16Updated 5 months ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆41Updated last year
- LLM Chat is an open-source serverless alternative to ChatGPT.☆34Updated 9 months ago
- Simulates talk with an AI that can express emotions☆71Updated last week
- Real time conversatio co-pilot able to generate suggestions from recorded audio☆13Updated last year
- Transcribe with ease :D☆15Updated 2 years ago
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆18Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆56Updated 10 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆26Updated 2 weeks ago
- ☆50Updated 7 months ago
- Discord chatbot interface to train an LLM on user message history☆27Updated 2 years ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 10 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆63Updated last week
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆36Updated 3 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 2 months ago
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆65Updated last week
- AURORA (Artificial Unified Responsive Optimized Reasoning Agent) uses lobes and web research for RAG based memory and learning.☆17Updated 7 months ago
- All Algorithms implemented in Python☆16Updated 3 months ago
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated 11 months ago
- ☆67Updated 3 months ago
- FastAPI service on top of WhisperX☆109Updated last week
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆34Updated 11 months ago
- Create a Scale-able Full Stack Education Platform with React-Tailwind, MongoDB & Nodejs☆12Updated last month
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆22Updated 8 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆95Updated last year