jim-schwoebel / pauses
π€ quick library to extract pause lengths from audio files.
β31Updated 5 years ago
Alternatives and similar repositories for pauses:
Users that are interested in pauses are comparing it to the libraries listed below
- [deprecated] Pretrained models for pyannote-audio 1.xβ72Updated 2 years ago
- Speaker diarization python system based on binary key speaker modellingβ61Updated 3 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksβ64Updated 4 years ago
- Wrapper for pydub AudioSegment objectsβ96Updated 2 years ago
- End-to-end spoken language identification out of the box.β48Updated 4 years ago
- Gentle and praatio scripts for easy forced alignmentβ18Updated 2 years ago
- Advanced data structures for handling temporal segments with attached labels.β111Updated last month
- A model that predicts the punctuation of English, Italian, French and German texts.β80Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ98Updated last month
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- Python library for handling audio datasets.β137Updated last year
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented textβ36Updated 4 years ago
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"β117Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.β24Updated 3 years ago
- A collection of basic python modules for spoken natural language processingβ56Updated 5 years ago
- Python library for audio augmentationβ83Updated last year
- Speaker diarization via transfer learningβ27Updated 6 years ago
- A Python toolbox for speech features extractionβ162Updated 2 years ago
- A lightweight library to compute Diarization Error Rate (DER).β59Updated last year
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated last year
- πΉ pyannote + π notebook = pyannotebookβ26Updated last year
- OpenAI Whisper Prompt Examplesβ52Updated last year
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systemsβ204Updated last month
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone dataβ96Updated last year
- Compute useful transcriptions metrics (CER, WER, SER, ...)β27Updated 10 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning meβ¦β40Updated 6 months ago
- Paper: https://arxiv.org/abs/1702.02285β63Updated 6 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ111Updated 2 years ago
- Support tools for punctuation and boundary detection for ASR output.β57Updated 2 years ago
- An online speech recognition extension toolkit of Kaldiβ56Updated 3 years ago