jim-schwoebel / pauses
π€ quick library to extract pause lengths from audio files.
β31Updated 5 years ago
Alternatives and similar repositories for pauses:
Users that are interested in pauses are comparing it to the libraries listed below
- Python library for handling audio datasets.β136Updated last year
- Speaker diarization python system based on binary key speaker modellingβ61Updated 3 years ago
- π A forced aligner intended for synchronization of narrated textβ89Updated 2 years ago
- A deep learning model is developed which can predict the native country on the basis of the spoken english accentβ47Updated 4 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ96Updated last week
- A collection of basic python modules for spoken natural language processingβ56Updated 5 years ago
- Wrapper for pydub AudioSegment objectsβ96Updated 2 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.β78Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.β24Updated 3 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented textβ36Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ72Updated 2 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)β27Updated 10 years ago
- Speaker diarization via transfer learningβ27Updated 5 years ago
- End-to-end spoken language identification out of the box.β48Updated 4 years ago
- Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.β15Updated last year
- A module for normalising text.β173Updated 3 years ago
- Jupyter Notebooks for creating Speech datasetsβ46Updated 5 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterancesβ49Updated 5 months ago
- Text and Punctuation correction with Deep Learningβ128Updated 4 years ago
- A TensorFlow Implementation of Punctuation Restoration.β18Updated 4 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated last year
- Advanced data structures for handling temporal segments with attached labels.β108Updated last week
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"β116Updated 2 years ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabetβ¦β43Updated 4 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksβ64Updated 4 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.β25Updated 2 years ago
- Gentle and praatio scripts for easy forced alignmentβ18Updated 2 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.β27Updated 8 months ago
- This is a legacy repo. Dev occurs now on GitHub.β11Updated 3 years ago
- β20Updated 6 years ago