gong-io / gecko
Gecko - A Tool for Effective Annotation of Human Conversations
☆274Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gecko
- DeepSpeech based forced alignment tool☆233Updated 3 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆366Updated 2 weeks ago
- Segment an audio file and obtain utterance alignments. (Python package)☆321Updated 5 months ago
- Diarization scoring tools.☆217Updated last year
- Speaker diarization python system based on binary key speaker modelling☆61Updated 2 years ago
- Evaluate your speech-to-text system with similarity measures such as word error rate (WER)☆631Updated last week
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆134Updated this week
- Open tools and data for cloudless automatic speech recognition☆443Updated 3 years ago
- Variational Bayes HMM over x-vectors diarization☆252Updated 9 months ago
- g2p: English Grapheme To Phoneme Conversion☆810Updated last year
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆464Updated 4 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆187Updated last year
- Grapheme to phoneme conversion with deep learning.☆358Updated 11 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆279Updated 4 months ago
- Spot the conversation: speaker diarisation in the wild☆123Updated 2 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆200Updated 3 years ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆236Updated last year
- Speaker diarization scripts, based on AaltoASR☆190Updated 5 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆510Updated last year
- A tool for automatic phoneme transcription☆156Updated last year
- Python library for handling audio datasets.☆131Updated last year
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆153Updated 4 years ago
- End-to-End Neural Diarization☆371Updated 3 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆427Updated last year
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆222Updated 3 months ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆155Updated 4 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆83Updated 3 weeks ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 2 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆273Updated 10 months ago
- Large, modern dataset for speech recognition☆644Updated 8 months ago