speechmatics / speechmatics-python
Python library and CLI for Speechmatics
☆70Updated 3 weeks ago
Alternatives and similar repositories for speechmatics-python:
Users that are interested in speechmatics-python are comparing it to the libraries listed below
- Javascript and Typescript SDK for Speechmatics☆47Updated this week
- Various speech datasets made available to the public☆115Updated 3 months ago
- ☆84Updated this week
- Reproducible experimental protocols for multimedia (audio, video, text) database☆98Updated last month
- Speaker Diarization with Transformers☆64Updated 10 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆81Updated last year
- ☆53Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆147Updated 11 months ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- A merged version of multiple open-source German speech datasets.☆31Updated 11 months ago
- Create an LJSpeech structured voice dataset on wave input☆27Updated 6 months ago
- Speaker diarization model☆25Updated 2 years ago
- OpenAI Whisper Prompt Examples☆52Updated last year
- ☆38Updated 3 years ago
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- An automatic speech recognition API☆55Updated 2 weeks ago
- Advanced data structures for handling temporal segments with attached labels.☆111Updated last month
- On-device voice activity detection (VAD) powered by deep learning☆204Updated this week
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆87Updated 2 years ago
- Simple Diarization model☆47Updated last year
- Evaluate results from ASR/Speech-to-Text quickly☆36Updated 3 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆117Updated 4 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆111Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆55Updated 11 months ago
- On-device speaker diarization powered by deep learning☆39Updated 2 weeks ago
- Tunable pipelines☆32Updated last month
- ☆280Updated 9 months ago
- ☆10Updated this week
- A python package for whisper normalizer☆53Updated last month