Haschtl / transcripyLinks
Multi speaker audio transcription
☆44Updated 3 years ago
Alternatives and similar repositories for transcripy
Users that are interested in transcripy are comparing it to the libraries listed below
Sorting:
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆250Updated 5 months ago
- LipSyncr is a lip reading web app based on the LipNet model that can lip read videos.☆77Updated 2 years ago
- On-device speaker diarization powered by deep learning☆66Updated 2 weeks ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆287Updated last month
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆90Updated last year
- Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.☆14Updated 2 years ago
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆118Updated last year
- Roomey is a multi-purpose Voice Agent designed to run your personal and business life.☆60Updated 7 months ago
- Second attempt at AI webcam, this time with OpenAI API☆40Updated 2 years ago
- streaming speech to text server using Whisper☆101Updated 2 years ago
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆18Updated 8 months ago
- 🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses SparkTTS, OpenAI, ElevenLabs or Kokoro☆386Updated last week
- Transcription and annotation interface for recorded audio or video files☆51Updated last week
- ☆18Updated 2 years ago
- 🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized …☆161Updated last month
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- Adapting Vercel's AI chatbot to use LiveKit as the transport☆21Updated 10 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆130Updated 2 years ago
- Podalize: Podcast Transcription and Analysis☆158Updated last year
- Self-hosted AI voice agent☆125Updated last year
- An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meet…☆65Updated 9 months ago
- Record audio or transcribe files using ctranslate2 and whisper!☆170Updated this week
- On-device speaker recognition engine powered by deep learning☆40Updated 2 weeks ago
- AI at your fingertips: powerful CLI tools for speech, text, and language processing☆22Updated last year
- Python app for LM Studio-enhanced voice conversations with local LLMs. Uses Whisper for speech-to-text and offers a privacy-focused, acce…☆134Updated last year
- ezlocalai is an easy to set up local artificial intelligence server with OpenAI Style Endpoints.☆91Updated last week
- TypeScript-based library for real-time audio transcription, integrating OpenAI's Whisper model for accurate speech-to-text conversion.☆72Updated 2 years ago
- Agent Studio is an AI agent application designed to handle real-time interactions through phone calls, web-based voice user interfaces (V…☆48Updated last year