jim-schwoebel / pausesLinks
π€ quick library to extract pause lengths from audio files.
β32Updated 6 years ago
Alternatives and similar repositories for pauses
Users that are interested in pauses are comparing it to the libraries listed below
Sorting:
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented textβ34Updated 5 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.β83Updated 2 years ago
- Wrapper for pydub AudioSegment objectsβ96Updated 2 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ71Updated 3 years ago
- πAn easy-to-use package to restore punctuation of the text.β119Updated 2 years ago
- π A forced aligner intended for synchronization of narrated textβ100Updated 4 months ago
- DeepSpeech based forced alignment toolβ239Updated 5 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- Gecko - A Tool for Effective Annotation of Human Conversationsβ298Updated last week
- A deep learning model is developed which can predict the native country on the basis of the spoken english accentβ51Updated 5 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ107Updated this week
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.β111Updated 6 months ago
- OpenAI Whisper Prompt Examplesβ52Updated 2 years ago
- β56Updated 2 years ago
- A module for normalising text.β173Updated 4 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)β27Updated 11 years ago
- A tool for automatic phoneme transcriptionβ159Updated 2 years ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.β268Updated 3 years ago
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.β83Updated 5 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksβ65Updated 5 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ119Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentationβ38Updated 9 months ago
- β14Updated 2 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognitionβ¦β99Updated 3 years ago
- Advanced data structures for handling temporal segments with attached labels.β122Updated 2 months ago
- A python package for deep multilingual punctuation prediction.β151Updated last year
- Speaker diarization python system based on binary key speaker modellingβ60Updated 3 years ago
- Timething is a library for aligning text transcripts with their audio recordings.β126Updated last year
- An HTML interface for finetuning the sync map output from aeneasβ53Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β80Updated 2 years ago