daanzu / py-webrtcvad-wheelsLinks
Python interface to the WebRTC Voice Activity Detector (VAD) [released with binary wheels!]
β34Updated last week
Alternatives and similar repositories for py-webrtcvad-wheels
Users that are interested in py-webrtcvad-wheels are comparing it to the libraries listed below
Sorting:
- πΈSTT integration examplesβ129Updated 3 years ago
- Advanced data structures for handling temporal segments with attached labels.β124Updated 3 months ago
- Model for recasing and repunctuating ASR transcriptsβ143Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ108Updated 2 weeks ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β329Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ153Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago
- Grapheme to phoneme conversion with deep learning.β415Updated 2 years ago
- An automatic speech recognition APIβ76Updated last month
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ119Updated 2 years ago
- A lightweight library to compute Diarization Error Rate (DER).β62Updated 2 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ103Updated 5 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.β134Updated last year
- Silence detection in audio stream using webrtcvadβ49Updated 2 years ago
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) stringsβ90Updated last year
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpusβ181Updated last year
- β156Updated 2 weeks ago
- β44Updated last year
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialogβ60Updated last year
- Voice analysis software (Python port of VoiceSauce)β60Updated 6 years ago
- Zero-shot Audio Classification using Whisperβ79Updated 3 years ago
- A merged version of multiple open-source German speech datasets.β33Updated last year
- A tool for automatic phoneme transcriptionβ159Updated 2 years ago
- Gecko - A Tool for Effective Annotation of Human Conversationsβ298Updated 3 weeks ago
- A python package for deep multilingual punctuation prediction.β152Updated last year
- A crash course for training speech recognition models using DeepSpeech.β24Updated 4 years ago
- Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detectionβ38Updated 5 months ago
- A model that predicts the punctuation of English, Italian, French and German texts.β83Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ26Updated 3 years ago
- Segment an audio file and obtain utterance alignments. (Python package)β343Updated last year