badgids / transcription-app
a transcription application that listens to audio input from the microphone using OpenAI's Whisper, transcribes it into text, and simulates typing the transcription in real-time wherever your cursor is on the screen. It can also do realtime translation.
☆18Updated 11 months ago
Alternatives and similar repositories for transcription-app:
Users that are interested in transcription-app are comparing it to the libraries listed below
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆44Updated last week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-tex…☆55Updated last month
- A simple extension that uses Bark Text-to-Speech for audio output☆35Updated last year
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆57Updated this week
- A TTS extension for oobabooga text WebUI☆29Updated 9 months ago
- ☆43Updated 3 months ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆42Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆65Updated last year
- Automated LLM novelist☆42Updated 10 months ago
- ☆69Updated 11 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆40Updated this week
- Create an LJSpeech structured voice dataset on wave input☆26Updated 4 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated 9 months ago
- ☆9Updated last year
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆48Updated 6 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 7 months ago
- Simulates talk with an AI that can express emotions☆54Updated 6 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆49Updated 2 months ago
- ☆40Updated 10 months ago
- An AI Discord bot that connects to a koboldcpp instance by API calls. Have a more intelligent Clyde Bot of your own making!☆34Updated 4 months ago
- Transcribe with ease :D☆14Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆88Updated 9 months ago
- An Extension for oobabooga/text-generation-webui☆36Updated last year
- Discord chatbot interface to train an LLM on user message history☆27Updated last year
- Viral Factory is a highly modular gradio app that automates the production of various forms of social media content. Thanks to it's comp…☆43Updated 2 months ago
- Based on kylemcdonald/i2i-realtime. The warping server for GenDJ real time webcam AI warping☆27Updated 7 months ago
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆27Updated 6 months ago
- Garvis: Realtime AI Voice Assistant☆36Updated 8 months ago
- ☆58Updated 5 months ago