ABX and kaldi experiments on speech corpora made easy
☆33Oct 7, 2024Updated last year
Alternatives and similar repositories for abkhazia
Users that are interested in abkhazia are comparing it to the libraries listed below
Sorting:
- ABX discrimination task in python☆45Oct 7, 2024Updated last year
- A Python toolbox for speech features extraction☆165Feb 8, 2023Updated 3 years ago
- Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15☆12Apr 17, 2017Updated 8 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- audio cfeatures extraction tool from wav to h5features format☆19May 24, 2019Updated 6 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- A monster repo for random research, not organized in any particular way☆13Dec 27, 2016Updated 9 years ago
- Digital Speech Processing in PyTorch.☆15Aug 12, 2022Updated 3 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- Python package for the Zero Speech Challenge 2020☆14Feb 5, 2021Updated 5 years ago
- Extended speech recognition neural network based on Kaldi for reproducible research☆15Aug 28, 2015Updated 10 years ago
- ☆28Oct 7, 2025Updated 4 months ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 10 years ago
- SWIG bindings for Kaldi I/O, built with Conda☆15Dec 15, 2024Updated last year
- Automatic speech recognition using neural networks☆18Nov 21, 2020Updated 5 years ago
- INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation☆21Mar 29, 2019Updated 6 years ago
- Bayesian spEEch Recognizer☆55Jan 11, 2021Updated 5 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- pronunciation LEXicons for Any Low-resource Language☆21Jul 14, 2020Updated 5 years ago
- Implementations of growing and pruning in neural networks☆22Jul 26, 2023Updated 2 years ago
- Multilingual grapheme-to-phoneme conversion☆20Feb 23, 2018Updated 8 years ago
- speech engine training projects☆29Apr 19, 2021Updated 4 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆24May 12, 2019Updated 6 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- An advance kaldi wrapper for Pyhton☆38Mar 1, 2021Updated 5 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Portal Tutorial☆11Feb 3, 2018Updated 8 years ago
- Audio source separation using CASA approaches in Python.☆11Apr 2, 2015Updated 10 years ago
- Transform audio files into mel spectrograms for text-to-speech model training☆12Aug 25, 2021Updated 4 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- Hybrid GAN (HiFi-WaveGAN) applied to footsteps sound effects☆12Jul 17, 2023Updated 2 years ago
- ASR transcription and SLU annotation web interface for call logs collected at UFAL-DSG.☆11Dec 8, 2014Updated 11 years ago