linto-ai / linto-studioLinks
Transcription and annotation interface for recorded audio or video files
☆41Updated this week
Alternatives and similar repositories for linto-studio
Users that are interested in linto-studio are comparing it to the libraries listed below
Sorting:
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆71Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆159Updated last year
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆63Updated last month
- AI core services for Jitsi☆65Updated last week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- web based editor for subtitles and transcripts☆142Updated last year
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash☆38Updated 3 weeks ago
- A tool for making videos from PDF presentations.☆31Updated 4 years ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆152Updated 3 weeks ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 2 months ago
- ☆29Updated last month
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆237Updated 2 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆65Updated last year
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆165Updated 2 months ago
- FastAPI service on top of WhisperX☆136Updated last week
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆60Updated 8 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆130Updated 6 months ago
- streaming speech to text server using Whisper☆95Updated 2 years ago
- openduplex uses speech-to-text, artificial intelligence and text-to-speech, to call businesses and make appointments for you☆36Updated 2 years ago
- Speaker diarization service☆24Updated 3 months ago
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) framework☆20Updated last year
- Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.☆118Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆80Updated last week
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆17Updated 4 months ago
- An automatic speech recognition API☆70Updated 3 weeks ago
- Effortlessly record, transcribe, and summarize meetings with this user-friendly desktop utility powered by OpenAI's Whisper and GPT-3.5-t…☆187Updated 2 years ago
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆28Updated 4 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆304Updated 3 months ago
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆267Updated last month
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆87Updated 8 months ago