kamperh / recipe_swbd_wordembedsView external linksLinks
☆22Mar 22, 2017Updated 8 years ago
Alternatives and similar repositories for recipe_swbd_wordembeds
Users that are interested in recipe_swbd_wordembeds are comparing it to the libraries listed below
Sorting:
- ☆45Apr 5, 2019Updated 6 years ago
- Siamese neural networks for representation learning using Theano.☆21Oct 14, 2015Updated 10 years ago
- ☆27Apr 21, 2017Updated 8 years ago
- A repository for dictionaries to be used with the Prosodylab-Aligner☆17May 13, 2014Updated 11 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 9 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Code for NAACL 2018 paper "Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information"☆13May 6, 2017Updated 8 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- A python implementation of Speech intelligibility in bits (SIIB)☆25Apr 4, 2022Updated 3 years ago
- Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…☆25Jul 6, 2017Updated 8 years ago
- Unsupervised word segmentation and clustering of speech☆13Feb 17, 2017Updated 9 years ago
- Text normalization scripts from IRISA lab☆14Jun 1, 2018Updated 7 years ago
- Zero-Mean Convolutions for Level-Invariant Singing Voice Detection☆11Jun 15, 2018Updated 7 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆33Apr 24, 2020Updated 5 years ago
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- ☆15May 26, 2021Updated 4 years ago
- Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models☆15Feb 20, 2019Updated 6 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 7 years ago
- Representations of language in a model of visually grounded speech signal.☆23Apr 19, 2018Updated 7 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- Feature extractor for DL speech processing.☆66Apr 13, 2022Updated 3 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆46Feb 6, 2025Updated last year
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Aug 22, 2017Updated 8 years ago
- MSR Identity Toolkit v1.0☆17Aug 18, 2017Updated 8 years ago
- ☆70Nov 30, 2020Updated 5 years ago
- Mel-Generalized Cepstrum analysis☆20Jul 21, 2017Updated 8 years ago
- generative models for speech☆20Jul 4, 2016Updated 9 years ago
- Download and create a tfreader for the audioset dataset☆16Apr 16, 2020Updated 5 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- ☆20Nov 22, 2020Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- ☆106Mar 12, 2021Updated 4 years ago