shahruk10 / kaldi-tfliteLinks
Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and diarization pipelines to tensorflow models.
☆20Updated 2 years ago
Alternatives and similar repositories for kaldi-tflite
Users that are interested in kaldi-tflite are comparing it to the libraries listed below
Sorting:
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆53Updated 2 months ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- Went online decode demo☆30Updated 4 years ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆70Updated 2 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆40Updated 2 years ago
- Python wrapper for kaldi's arpa2fst☆38Updated 7 months ago
- A light-weight Python library for computing Kaldi-style acoustic features based on NumPy☆14Updated 4 years ago
- ☆25Updated 8 months ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 5 years ago
- Implementation of audio degradation processes☆103Updated 9 years ago
- Text frontend for ESPnet tts recipes☆34Updated 4 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 6 years ago
- Online streaming speaker change detection model in Pytorch☆40Updated 2 years ago
- python wrapper for kaldi's native I/O☆27Updated 6 months ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆40Updated 4 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 3 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 5 years ago
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- Yin pitch estimator in PyTorch☆114Updated 2 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆94Updated last year
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆151Updated 3 years ago
- ☆29Updated 3 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆70Updated 4 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆24Updated last year
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆140Updated last month
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆49Updated last year
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆53Updated 2 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 5 years ago