Unsupervised word segmentation and clustering of speech
☆13Feb 17, 2017Updated 9 years ago
Alternatives and similar repositories for segmentalist
Users that are interested in segmentalist are comparing it to the libraries listed below
Sorting:
- ☆13Jan 13, 2022Updated 4 years ago
- Using embedding-based loss functions for phonetics/speech recognition.☆17Nov 24, 2014Updated 11 years ago
- Software for unsupervised word segmentation and language model learning using lattices☆45Aug 17, 2016Updated 9 years ago
- Pitman-Yor processes in python☆26Apr 18, 2014Updated 11 years ago
- Software to apply unsupervised word segmentation on lattices or text sequences using a nested hierarchical Pitman Yor language model☆17Nov 24, 2016Updated 9 years ago
- A repository for dictionaries to be used with the Prosodylab-Aligner☆17May 13, 2014Updated 11 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Jan 23, 2022Updated 4 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- ☆22Mar 22, 2017Updated 8 years ago
- ☆24Sep 25, 2018Updated 7 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆85Oct 1, 2020Updated 5 years ago
- ☆27Apr 21, 2017Updated 8 years ago
- Zero-Resource Speech Discovery, Search, and Evaluation Tools☆29Aug 6, 2015Updated 10 years ago
- Gaussian Mixture VAE Tacotron☆53Jul 6, 2023Updated 2 years ago
- An extended TSP (Time Stretched Pulse). CAPRICEP substantially replaces FVN. CAPRICEP enables interactive and real-time measurement of th…☆29Nov 2, 2023Updated 2 years ago
- ☆30May 3, 2023Updated 2 years ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- This repository☆30Nov 13, 2022Updated 3 years ago
- Tensorflow with KenLM integrated for beam search scoring☆34Jul 28, 2017Updated 8 years ago
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆147Apr 5, 2024Updated last year
- ☆33Nov 7, 2019Updated 6 years ago
- Handling audio files in Python☆39Feb 12, 2026Updated 2 weeks ago
- Palette-class IPA Unicode Input Method for Mac OS☆49Jan 2, 2021Updated 5 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- ☆13Updated this week
- Phonetic Analysis ToolKIT - PATKIT - Python package for analysing phonetic data☆11Updated this week
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- a python library for different types of vocoders like LPC, MCEP, PSOLA, etc.☆36Feb 21, 2015Updated 11 years ago
- ☆14May 25, 2022Updated 3 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- Next word prediction based on N-gram language model☆12Jan 11, 2015Updated 11 years ago
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆11Nov 27, 2022Updated 3 years ago
- Matplotlib Image labeller for classifying images☆11Jan 5, 2026Updated last month
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 3 months ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Github mirror of MediaWiki extension Wikispeech - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Develo…☆12Updated this week