badgids / transcription-app
a transcription application that listens to audio input from the microphone using OpenAI's Whisper, transcribes it into text, and simulates typing the transcription in real-time wherever your cursor is on the screen. It can also do realtime translation.
☆13Updated 6 months ago
Related projects: ⓘ
- ☆38Updated this week
- A TTS extension for oobabooga text WebUI☆26Updated 4 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆56Updated 4 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆43Updated last month
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆29Updated last month
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆40Updated 7 months ago
- Transcribe with ease :D☆13Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆63Updated last year
- ☆74Updated 2 months ago
- ☆66Updated 6 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆47Updated 4 months ago
- A simple extension that uses Bark Text-to-Speech for audio output☆35Updated 10 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆81Updated 4 months ago
- Discord chatbot interface to train an LLM on user message history☆27Updated last year
- Site for sharing Bark voices☆47Updated 2 months ago
- API server for Instant voice cloning by MyShell.☆59Updated 4 months ago
- ☆44Updated this week
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆119Updated 2 months ago
- ☆62Updated 4 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated last month
- RAG implementation for Ooba characters. dynamically spins up new qdrant vector DB and manages retrieval and commits for conversations ba…☆45Updated 11 months ago
- ☆56Updated 3 weeks ago
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆32Updated last year
- Diffusion_TTS extension for booga☆59Updated 2 months ago
- Llama cute voice assistant☆28Updated last year
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆39Updated 3 months ago
- On-device streaming text-to-speech engine powered by deep learning☆43Updated last week
- Windows-compatible Fast API implementation of VoiceCraft, the Zero-Shot Speech Editing and Text-to-Speech in the Wild☆17Updated 4 months ago
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…☆67Updated 3 weeks ago
- XTTSv2 Extension for oobabooga text-generation-webui☆33Updated 2 months ago