A python IO interface for data accessing in kaldi
☆39Mar 18, 2021Updated 5 years ago
Alternatives and similar repositories for kaldi-python-io
Users that are interested in kaldi-python-io are comparing it to the libraries listed below
Sorting:
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Sep 4, 2019Updated 6 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆378Jun 16, 2023Updated 2 years ago
- A pure python module for reading and writing kaldi ark files☆268Mar 6, 2025Updated last year
- ☆48Jan 8, 2021Updated 5 years ago
- Tools for Speech Enhancement integrated with Kaldi☆428Jul 6, 2023Updated 2 years ago
- Speech separation with utterance-level PIT experiments☆106Jul 12, 2018Updated 7 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- ☆131Aug 9, 2018Updated 7 years ago
- ☆11Jun 15, 2022Updated 3 years ago
- Time-domain Audio Separation Network☆24Aug 3, 2018Updated 7 years ago
- Script for converting kaldi GMM/HMM models to HTK format☆11Jul 18, 2024Updated last year
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Feb 6, 2025Updated last year
- Remove noise from sound clips by use of supervised training and an ideal ratio mask.☆14Apr 2, 2019Updated 6 years ago
- NIST SPH File reader (e.g. for TEDLIUM Corpus)☆26May 2, 2020Updated 5 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Sep 18, 2017Updated 8 years ago
- ☆60Sep 26, 2020Updated 5 years ago
- A temporal module for PyTorch-ComplexTensor☆44Jun 28, 2024Updated last year
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- SWIG bindings for Kaldi I/O, built with Conda☆15Dec 15, 2024Updated last year
- python codes to extract MFCC and FBANK speech features for Kaldi☆67Nov 28, 2018Updated 7 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆221Jan 14, 2021Updated 5 years ago
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆14Dec 9, 2015Updated 10 years ago
- 语音识别 语音前端处理 语音合成 语音转换等等语音技术的资料汇总☆23Nov 8, 2019Updated 6 years ago
- A Python wrapper for Kaldi☆1,030Nov 30, 2025Updated 3 months ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆18Nov 19, 2025Updated 4 months ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆85Jun 17, 2025Updated 9 months ago
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Nov 25, 2022Updated 3 years ago
- Wake-Up-Word Keyword Spotting implemented in Keras☆35Oct 1, 2017Updated 8 years ago
- solutions for https://www.kaggle.com/c/tensorflow-speech-recognition-challenge☆31Jan 28, 2018Updated 8 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Oct 1, 2017Updated 8 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- ☆16Mar 7, 2019Updated 7 years ago
- ☆38May 16, 2022Updated 3 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Jul 10, 2019Updated 6 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago