jim-schwoebel / pausesLinks

🎤 quick library to extract pause lengths from audio files.

☆31

Alternatives and similar repositories for pauses

Users that are interested in pauses are comparing it to the libraries listed below

Sorting:

pyannote / DEPRECATED-pyannote-audio-hub
[deprecated] Pretrained models for pyannote-audio 1.x
☆72Updated 2 years ago
tango4j / Auto-Tuning-Spectral-Clustering
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
☆119Updated 3 years ago
pyannote / pyannote-database
Reproducible experimental protocols for multimedia (audio, video, text) database
☆100Updated 3 months ago
chrisspen / punctuator2
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
☆36Updated 4 years ago
oliverguhr / fullstop-deep-punctuation-prediction
A model that predicts the punctuation of English, Italian, French and German texts.
☆80Updated 2 years ago
pyannote / pyannote-core
Advanced data structures for handling temporal segments with attached labels.
☆113Updated 3 months ago
wq2012 / SimpleDER
A lightweight library to compute Diarization Error Rate (DER).
☆59Updated last year
josepatino / pyBK
Speaker diarization python system based on binary key speaker modelling
☆61Updated 3 years ago
SuperKogito / pydiogment
Python library for audio augmentation
☆84Updated last year
jumon / whisper-punctuator
Zero-shot multimodal punctuation insertion and truecasing using Whisper
☆114Updated 2 years ago
philipperemy / speaker-change-detection
Paper: https://arxiv.org/abs/1702.02285
☆64Updated 6 years ago
akshanshchaudhry / Speech-Accent-Recognition
A deep learning model is developed which can predict the native country on the basis of the spoken english accent
☆48Updated 5 years ago
oliverguhr / deepmultilingualpunctuation
A python package for deep multilingual punctuation prediction.
☆123Updated 9 months ago
r4victor / afaligner
📈 A forced aligner intended for synchronization of narrated text
☆93Updated 2 years ago
py-lidbox / lidbox
End-to-end spoken language identification out of the box.
☆48Updated 4 years ago
mohamad-hasan-sohan-ajini / G2P
Grapheme To Phoneme
☆73Updated 10 months ago
pyannote / pyannote-metrics
A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
☆210Updated 3 months ago
cldf / segments
Unicode Standard tokenization routines and orthography profile segmentation
☆37Updated 3 months ago
jdvala / zoom_audio_transcribe
Zoom Audio Transcription offline
☆32Updated 4 years ago
yinruiqing / change_detection
Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks
☆65Updated 4 years ago
bootphon / shennong
A Python toolbox for speech features extraction
☆163Updated 2 years ago
jpuigcerver / xer
Compute useful transcriptions metrics (CER, WER, SER, ...)
☆27Updated 10 years ago
MaxStrange / AudioSegment
Wrapper for pydub AudioSegment objects
☆96Updated 2 years ago
ynop / audiomate
Python library for handling audio datasets.
☆138Updated last year
RuABraun / texterrors
☆36Updated last month
Appen / UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
☆102Updated 2 years ago
Edresson / Wav2Vec-Wrapper
An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
☆82Updated 2 years ago
cvqluu / simple_diarizer
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
☆149Updated last year
alphacep / whisper-prompts
OpenAI Whisper Prompt Examples
☆52Updated last year
coqui-ai / TTS-recipes
🐸TTS recipes for different datasets
☆87Updated 2 years ago