Using embedding-based loss functions for phonetics/speech recognition.
☆17Nov 24, 2014Updated 11 years ago
Alternatives and similar repositories for speech_embeddings
Users that are interested in speech_embeddings are comparing it to the libraries listed below
Sorting:
- Find how to pronounce words by breaking them up into their phones.☆24Jul 7, 2017Updated 8 years ago
- Code for NAACL 2018 paper "Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information"☆13May 6, 2017Updated 8 years ago
- Convolutional REpresenations for Music Analysis☆12Jul 5, 2016Updated 9 years ago
- Unsupervised word segmentation and clustering of speech☆13Feb 17, 2017Updated 9 years ago
- ☆13Jan 13, 2022Updated 4 years ago
- ☆33Nov 7, 2019Updated 6 years ago
- Music structure segmentation with convnets☆13Mar 11, 2016Updated 9 years ago
- Mel-Generalized Cepstrum analysis☆20Jul 21, 2017Updated 8 years ago
- Deep Learning Tutorial in Python with Keras library☆21Feb 21, 2017Updated 9 years ago
- A repository for dictionaries to be used with the Prosodylab-Aligner☆17May 13, 2014Updated 11 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- A simple toolkit for speaker segmentation and identification☆31Jun 15, 2013Updated 12 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆24May 12, 2019Updated 6 years ago
- singing voice analysis and detection tools☆21Jun 10, 2015Updated 10 years ago
- A python implementation of Speech intelligibility in bits (SIIB)☆25Apr 4, 2022Updated 3 years ago
- An extended TSP (Time Stretched Pulse). CAPRICEP substantially replaces FVN. CAPRICEP enables interactive and real-time measurement of th…☆29Nov 2, 2023Updated 2 years ago
- numeric fused-head identification and resolution☆33Oct 16, 2019Updated 6 years ago
- ☆30May 3, 2023Updated 2 years ago
- Trained data models for madmom: https://github.com/CPJKU/madmom☆25Mar 22, 2022Updated 3 years ago
- Hadoop MapReduce training of modified Kneser-Ney smoothed language models☆29Jun 12, 2018Updated 7 years ago
- Room Acoustics Impulse Response Generator using the Randomized Image Method (RIM)☆26May 29, 2022Updated 3 years ago
- A Praat plug-in for performing interactive phonetic forced alignment☆29Sep 22, 2018Updated 7 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Mar 1, 2017Updated 9 years ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- Detect calls of attention in the surroundings☆52Jun 10, 2013Updated 12 years ago
- La-O-Dan iOS study☆19Oct 11, 2011Updated 14 years ago
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- Audio library for modelling loudness☆40Aug 8, 2019Updated 6 years ago
- This is not the official kaldi repository. It is better to fork https://github.com/kaldi-asr/kaldi or https://github.com/vimalmanohar/kal…☆33Aug 6, 2015Updated 10 years ago
- [Deprecated] Statistical Voice Conversion in Julia. See the website link for new library☆38Apr 15, 2017Updated 8 years ago
- Handling audio files in Python☆39Feb 12, 2026Updated 3 weeks ago
- Listen to the weather using Sonic Pi and data from Mathematica☆11Dec 6, 2018Updated 7 years ago
- a python library for different types of vocoders like LPC, MCEP, PSOLA, etc.☆36Feb 21, 2015Updated 11 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Mar 7, 2023Updated 2 years ago
- PyGun: Procedural Generation of Anechoic Gunshot Sounds☆14Oct 8, 2016Updated 9 years ago
- Palette-class IPA Unicode Input Method for Mac OS☆49Jan 2, 2021Updated 5 years ago
- Matplotlib Image labeller for classifying images☆11Jan 5, 2026Updated 2 months ago