linto-ai / linto-studioLinks
Transcription and annotation interface for recorded audio or video files
β39Updated last week
Alternatives and similar repositories for linto-studio
Users that are interested in linto-studio are comparing it to the libraries listed below
Sorting:
- π¬ Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!β57Updated 2 months ago
- AI core services for Jitsiβ62Updated 2 weeks ago
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flashβ34Updated last month
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAIβ17Updated 2 months ago
- ez audio transcription tool with flexible processing and post-processing optionsβ156Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.β12Updated 10 months ago
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++β68Updated last year
- β28Updated last month
- Real-Time Whisper Voice Recognition with vosk model feedback.β117Updated 2 years ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, andβ¦β106Updated 4 months ago
- FastAPI service on top of WhisperXβ120Updated this week
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.β52Updated 6 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β223Updated 4 months ago
- OpenAI Whisper API-style local server, runnig on FastAPIβ83Updated 8 months ago
- An automatic speech recognition APIβ66Updated 3 weeks ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS β¦β14Updated 4 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β64Updated last year
- web based editor for subtitles and transcriptsβ137Updated 11 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into diβ¦β271Updated last month
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.β136Updated 2 weeks ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.β66Updated this week
- A tool for making videos from PDF presentations.β28Updated 4 years ago
- streaming speech to text server using Whisperβ94Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.β53Updated 7 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ98Updated last month
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning β¦β26Updated 2 years ago
- a simple system for 2-way interruptible voice interactions between human and LLMβ30Updated last year
- Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.β116Updated last year
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.β146Updated 3 weeks ago
- Roomey is a multi-purpose Voice Agent designed to run your personal and business life.β45Updated last month