linto-ai / linto-studio
Transcription and annotation interface for recorded audio or video files
☆24Updated this week
Related projects: ⓘ
- ez audio transcription tool with flexible processing and post-processing options☆122Updated 7 months ago
- AI core services for Jitsi☆24Updated this week
- streaming speech to text server using Whisper☆75Updated last year
- ☆32Updated last year
- web based editor for subtitles and transcripts☆102Updated last month
- a transcription application that listens to audio input from the microphone using OpenAI's Whisper, transcribes it into text, and simulat…☆13Updated 6 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆73Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆81Updated 4 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆28Updated 6 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆51Updated 8 months ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆124Updated 3 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆55Updated this week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆103Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆134Updated 3 weeks ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆84Updated this week
- openduplex uses speech-to-text, artificial intelligence and text-to-speech, to call businesses and make appointments for you☆24Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆120Updated last year
- Fast! Offline, privacy-focused, hands-free voice typing, 2-way AI voice chat, AI images, webcam, recorder, voice control, in under 4 GiB …☆157Updated this week
- Ìpàdé (Yoruba word for Meeting) is a web-serverless version of Openfire Meetings hosted on GitPages.☆21Updated last year
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆42Updated 3 months ago
- Code for OpenAI Whisper Web App Demo☆95Updated last year
- A testing repo to share code and thoughts on diarisation☆50Updated 5 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆56Updated 4 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆188Updated 2 months ago
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakin…☆26Updated last year
- Public voice datasets used for our Text-to-Speech voices.☆25Updated last month
- Offline voice input panel & keyboard with punctuation for Android.☆84Updated 3 months ago
- A tool for making videos from PDF presentations.☆21Updated 3 years ago
- A curated list of awesome OpenAI's Whisper☆91Updated last year
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆147Updated 3 weeks ago