linto-ai / linto-studio
Transcription and annotation interface for recorded audio or video files
☆31Updated this week
Alternatives and similar repositories for linto-studio:
Users that are interested in linto-studio are comparing it to the libraries listed below
- ez audio transcription tool with flexible processing and post-processing options☆148Updated last year
- Speaker diarization service☆21Updated last month
- streaming speech to text server using Whisper☆89Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆11Updated 6 months ago
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆38Updated this week
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆60Updated this week
- web based editor for subtitles and transcripts☆127Updated 7 months ago
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆61Updated 10 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆62Updated last year
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆192Updated last month
- zero-shot realtime TTS system, fully offline, free and open source☆32Updated this week
- Uses machine learning to denoise audio containing speech☆32Updated 9 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆33Updated this week
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆25Updated 8 months ago
- Transcribe audio and video files with speaker diarization and logically grouped timestamps☆12Updated 3 weeks ago
- openduplex uses speech-to-text, artificial intelligence and text-to-speech, to call businesses and make appointments for you☆31Updated last year
- AI core services for Jitsi☆53Updated this week
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆51Updated this week
- SEPIA server to support open-source speech recognition via WebSocket connection.☆123Updated 4 months ago
- An automatic speech recognition API☆55Updated last week
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆16Updated 7 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆61Updated 3 weeks ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 6 months ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 3 months ago
- Generate video stories with AI ✨☆32Updated 7 months ago
- An even smaller speech recognizer / force aligner☆32Updated 3 months ago