badgids / transcription-app
a transcription application that listens to audio input from the microphone using OpenAI's Whisper, transcribes it into text, and simulates typing the transcription in real-time wherever your cursor is on the screen. It can also do realtime translation.
☆21Updated last year
Alternatives and similar repositories for transcription-app:
Users that are interested in transcription-app are comparing it to the libraries listed below
- Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-tex…☆57Updated 3 months ago
- Using LLMs and rules for a local personal agent☆16Updated 3 months ago
- Okra, your all in one personal AI assistant☆14Updated 10 months ago
- An AI Discord bot that connects to a koboldcpp instance by API calls. Have a more intelligent Clyde Bot of your own making!☆37Updated 6 months ago
- time based thinking and structure like OpenAI's o1 preview.☆10Updated 7 months ago
- Fetch related searches for a given query from Google Trends, and for each related search, it retrieves the date it was most popular and i…☆13Updated last month
- AURORA (Artificial Unified Responsive Optimized Reasoning Agent) uses lobes and web research for RAG based memory and learning.☆17Updated 5 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- A powerful AI content generation tool that leverages GPT-4 and LangChain to automatically create SEO-optimized blog posts and structured …☆24Updated 2 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 4 months ago
- Create storybooks using CrewAI, Groq, and Ollama☆20Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆48Updated last week
- AI Search engine☆12Updated 2 months ago
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆91Updated last year
- ☆40Updated last year
- Seamless Voice Interactions with LLMs☆12Updated last year
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆42Updated 3 weeks ago
- ☆63Updated last month
- ☆27Updated last year
- ☆9Updated last year
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆34Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 2 weeks ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆44Updated 3 months ago
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆61Updated 3 weeks ago
- Powered by OpenAI Whisper & Gradio☆30Updated 2 years ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆41Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆107Updated 2 months ago