shahruk10 / kaldi-tfliteView external linksLinks
Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and diarization pipelines to tensorflow models.
☆20Oct 6, 2022Updated 3 years ago
Alternatives and similar repositories for kaldi-tflite
Users that are interested in kaldi-tflite are comparing it to the libraries listed below
Sorting:
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- ☆13Apr 14, 2024Updated last year
- Keyword Spotting suitable for embedded devices.☆28Jun 22, 2020Updated 5 years ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Mar 9, 2022Updated 3 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆17Apr 28, 2021Updated 4 years ago
- Pure C# port of the Pocketsphinx keyword spotter☆13Jan 19, 2020Updated 6 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Apr 7, 2021Updated 4 years ago
- ☆15Feb 6, 2026Updated last week
- ☆13Oct 27, 2021Updated 4 years ago
- An implementation of the Wav2Letter Speech-to-Text model using PyTorch.☆14Mar 8, 2023Updated 2 years ago
- Launch your speech synthesis within one minute.☆12May 6, 2024Updated last year
- ☆16Sep 12, 2019Updated 6 years ago
- ☆13Mar 25, 2021Updated 4 years ago
- Test Framework for few-shot open set KWS☆41Nov 8, 2024Updated last year
- Assistance component base for Dicio assistant components☆13May 27, 2024Updated last year
- ☆12Jun 10, 2021Updated 4 years ago
- A light-weight Python library for computing Kaldi-style acoustic features based on NumPy☆14Aug 17, 2020Updated 5 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- ☆33Aug 6, 2021Updated 4 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Tools for working with the CMU Pronunciation Dictionary☆36Sep 5, 2017Updated 8 years ago
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)☆18Feb 29, 2024Updated last year
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 9 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 2 years ago
- Tensorflow training scripts for depthwise separable convolutional neural networks for keyword spotting, and C++ code for deployment.☆39Apr 2, 2020Updated 5 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- A neural language modeling toolkit built on PyTorch☆19Mar 17, 2023Updated 2 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- mnn tts demo.☆19May 7, 2025Updated 9 months ago
- Empower Your Voice, Secure Your Privacy - Experience VoiceAssistant, Your Customizable Offline Voice Assistant!☆18Jul 15, 2025Updated 7 months ago
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆21Jan 26, 2020Updated 6 years ago
- [NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords☆18Nov 30, 2024Updated last year