marktnoonan / transcription
Live Transcription based on Speech Recognition API
☆35Updated last year
Related projects ⓘ
Alternatives and complementary repositories for transcription
- A lightweight transcript editor for editing and correcting STT generated timed transcripts☆37Updated last week
- Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesi…☆40Updated 3 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated last year
- Hyperaudio Lite - a Super-lightweight Interactive Transcript Player☆127Updated this week
- Open models for Coqui STT☆122Updated last year
- ☆13Updated last year
- A VoiceAsistant with WhisperAI speech recognition☆29Updated 2 weeks ago
- Voice models for Mimic 3 text to speech system☆131Updated 5 months ago
- Educational player with phrasal playback and parallel multi-language subtitles. Online subtitles/captions editor.☆19Updated 9 months ago
- Takes audio and reference transcriptions in bulk and generates WER☆13Updated 3 years ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- A simple audio file transcriber that uses the Google Cloud Speech API for transcription.☆26Updated 5 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆35Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- A collection of pre-built speech synthesis settings used to convey emotion☆11Updated 5 years ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆33Updated 2 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆120Updated 2 weeks ago
- Text frontend for ESPnet tts recipes☆31Updated 3 years ago
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆63Updated 11 months ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆200Updated 3 months ago
- A Node.js server that accepts audio/video files and transcribes the content☆61Updated 3 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- Docker images for Coqui AI☆57Updated 3 years ago
- DeepSpeech based forced alignment tool☆235Updated 3 years ago
- A simple voice conversion tool☆15Updated 2 years ago
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- Displays text in sync with audio being played. Works with VTT files.☆41Updated 6 years ago
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Updated 4 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆141Updated 6 months ago
- Community framework for training tortoise☆38Updated 2 years ago