linto-ai / linto-studioLinks
Transcription and annotation interface for recorded audio or video files
β40Updated this week
Alternatives and similar repositories for linto-studio
Users that are interested in linto-studio are comparing it to the libraries listed below
Sorting:
- π¬ Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!β63Updated last week
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flashβ37Updated last week
- ez audio transcription tool with flexible processing and post-processing optionsβ158Updated last year
- AI core services for Jitsiβ64Updated this week
- FastAPI service on top of WhisperXβ129Updated this week
- Real-Time Whisper Voice Recognition with vosk model feedback.β119Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β64Updated last year
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++β70Updated last year
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.β57Updated 8 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β234Updated last month
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into diβ¦β291Updated 2 months ago
- web based editor for subtitles and transcriptsβ141Updated last year
- A tool for making videos from PDF presentations.β31Updated 4 years ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS β¦β14Updated last month
- β40Updated last week
- A simple TTS server for generating speech using StyleTTS2β37Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.β12Updated 11 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.β68Updated 11 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning β¦β26Updated 2 years ago
- streaming speech to text server using Whisperβ93Updated 2 years ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, andβ¦β118Updated 5 months ago
- Effortlessly record, transcribe, and summarize meetings with this user-friendly desktop utility powered by OpenAI's Whisper and GPT-3.5-tβ¦β188Updated 2 years ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.β161Updated 2 months ago
- Speaker diarization serviceβ23Updated 3 months ago
- An automatic speech recognition APIβ68Updated last month
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAIβ17Updated 3 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.β54Updated 9 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ100Updated 3 months ago
- Open models for Coqui STTβ142Updated 2 years ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime portβ¦β23Updated 3 weeks ago