speechmatics / speechmatics-pythonLinks
Python library and CLI for Speechmatics
☆74Updated 4 months ago
Alternatives and similar repositories for speechmatics-python
Users that are interested in speechmatics-python are comparing it to the libraries listed below
Sorting:
- ☆358Updated last year
- Gecko - A Tool for Effective Annotation of Human Conversations☆298Updated 3 weeks ago
- 🐸STT integration examples☆129Updated 3 years ago
- Various speech datasets made available to the public☆130Updated last year
- Advanced data structures for handling temporal segments with attached labels.☆124Updated 3 months ago
- DeepSpeech based forced alignment tool☆239Updated 5 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆329Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆83Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆153Updated last year
- Evaluate results from ASR/Speech-to-Text quickly☆40Updated 3 years ago
- ☆319Updated last year
- Finetune VITS and MMS using HuggingFace's tools☆182Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆108Updated 2 weeks ago
- A python package for deep multilingual punctuation prediction.☆152Updated last year
- A python package for whisper normalizer☆70Updated 2 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆218Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 3 years ago
- An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.☆170Updated 7 months ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.☆49Updated 4 years ago
- ☆156Updated 2 weeks ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆58Updated last year
- Command line tool to create corpora for Common Voice☆78Updated 3 weeks ago
- On-device voice activity detection (VAD) powered by deep learning☆238Updated last week
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Batch Support for OpenAI Whisper☆95Updated last year
- Program to benchmark various speech recognition APIs☆81Updated 6 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 3 years ago
- An automatic speech recognition API☆76Updated last month