gong-io / gecko
Gecko - A Tool for Effective Annotation of Human Conversations
☆279Updated last year
Alternatives and similar repositories for gecko:
Users that are interested in gecko are comparing it to the libraries listed below
- DeepSpeech based forced alignment tool☆235Updated 4 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆325Updated 8 months ago
- Python library for handling audio datasets.☆136Updated last year
- A list of publically available audio data that anyone can download for ASR or other speech activities☆203Updated 3 years ago
- Various speech datasets made available to the public☆110Updated last month
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆200Updated 2 weeks ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆153Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Grapheme to phoneme conversion with deep learning.☆372Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.☆299Updated 2 months ago
- Diarization scoring tools.☆233Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆94Updated 2 weeks ago
- 🙊 software for creating speech recognition models.☆156Updated 7 months ago
- Large, modern dataset for speech recognition☆657Updated 11 months ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆392Updated 3 months ago
- Speaker embedding (d-vector) trained with GE2E loss☆274Updated last year
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆145Updated 8 months ago
- Open tools and data for cloudless automatic speech recognition☆447Updated 3 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆435Updated last year
- End-to-End Neural Diarization☆390Updated 3 years ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆241Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated last year
- Variational Bayes HMM over x-vectors diarization☆261Updated last year
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated last year
- Massively multilingual pronunciation mining☆331Updated 2 months ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- g2p: English Grapheme To Phoneme Conversion☆835Updated 2 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆230Updated 2 years ago