marktnoonan / transcription
Live Transcription based on Speech Recognition API
☆35Updated last year
Related projects ⓘ
Alternatives and complementary repositories for transcription
- Web app for keyword spotting using TensorflowJS☆69Updated last year
- DeepSpeech based forced alignment tool☆233Updated 3 years ago
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆54Updated 3 years ago
- Unofficial Keras implementation of Google AI VoiceFilter☆35Updated last year
- Simple Diarization model☆42Updated 11 months ago
- Gecko - A Tool for Effective Annotation of Human Conversations☆274Updated last year
- Python forced alignment☆72Updated 6 months ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated last year
- ☆13Updated last year
- speaker diarization system using an LSTM☆49Updated last year
- Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesi…☆40Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learning☆173Updated 2 weeks ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated 2 years ago
- How to create your own model for vosk☆64Updated 3 years ago
- ☆32Updated 9 months ago
- An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech☆85Updated 2 months ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated last year
- Hyperaudio Lite - a Super-lightweight Interactive Transcript Player☆127Updated 2 weeks ago
- ONNX Inference of Pyannote Segmentation☆65Updated last month
- A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket conne…☆216Updated 4 years ago
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.☆33Updated 10 months ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆98Updated last year
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆58Updated 2 years ago
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- Diarization scoring tools.☆217Updated last year
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆222Updated 3 months ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆199Updated 3 months ago
- An even smaller speech recognizer / force aligner☆32Updated 2 months ago
- Mozilla Voice Community Playbook☆43Updated 5 months ago