speechmatics / speechmatics-pythonLinks
Python library and CLI for Speechmatics
☆75Updated 5 months ago
Alternatives and similar repositories for speechmatics-python
Users that are interested in speechmatics-python are comparing it to the libraries listed below
Sorting:
- ☆357Updated last year
- Various speech datasets made available to the public☆130Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆243Updated 2 weeks ago
- ☆323Updated last year
- Zero-shot Audio Classification using Whisper☆79Updated 3 years ago
- Advanced data structures for handling temporal segments with attached labels.☆124Updated 4 months ago
- A python package for whisper normalizer☆74Updated 4 months ago
- 🐸STT integration examples☆130Updated 3 years ago
- A merged version of multiple open-source German speech datasets.☆34Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆112Updated 2 months ago
- ☆172Updated last week
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆150Updated 2 years ago
- A python package for deep multilingual punctuation prediction.☆156Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.☆332Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- Batch Support for OpenAI Whisper☆97Updated 2 years ago
- Tunable pipelines☆41Updated 4 months ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆83Updated 2 years ago
- ☆24Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 3 years ago
- ☆45Updated 3 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.☆50Updated 4 years ago
- Gecko - A Tool for Effective Annotation of Human Conversations☆301Updated 2 months ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 3 years ago
- Finetune VITS and MMS using HuggingFace's tools☆189Updated last year
- 📝An easy-to-use package to restore punctuation of the text.☆119Updated 2 years ago
- ☆49Updated 3 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆345Updated last year
- An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.☆169Updated 3 weeks ago