jim-schwoebel / pauses
🎤 quick library to extract pause lengths from audio files.
☆32Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for pauses
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆62Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 2 years ago
- End-to-end spoken language identification out of the box.☆48Updated 3 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆84Updated last month
- Advanced data structures for handling temporal segments with attached labels.☆99Updated 5 months ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 3 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 2 years ago
- Tunable pipelines☆30Updated last month
- Wrapper for pydub AudioSegment objects☆95Updated last year
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆74Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆99Updated last year
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆22Updated last year
- Code for AccentDB.☆19Updated 3 years ago
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Updated 4 years ago
- Speaker diarization service☆19Updated this week
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆111Updated 2 years ago
- Feature extractor for DL speech processing.☆65Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-rater☆11Updated 5 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆25Updated last year
- Python library for handling audio datasets.☆131Updated last year
- DeepSpeech based forced alignment tool☆235Updated 3 years ago
- Keras(Tensorflow) implementations of Automatic Speech Recognition☆22Updated 2 years ago
- A python package for deep multilingual punctuation prediction.☆98Updated 3 months ago