shahruk10 / kaldi-tfliteLinks
Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and diarization pipelines to tensorflow models.
☆20Updated 3 years ago
Alternatives and similar repositories for kaldi-tflite
Users that are interested in kaldi-tflite are comparing it to the libraries listed below
Sorting:
- An online speech recognition extension toolkit of Kaldi☆56Updated 4 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆54Updated 3 months ago
- The codebase for Data-driven general-purpose voice activity detection.☆94Updated 2 years ago
- Went online decode demo☆31Updated 4 years ago
- A pitch tracker inspired by David Talkin's RAPT (Robust Algorithm for Pitch Tracking) written in Python.☆48Updated 9 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Updated last year
- Python wrapper for kaldi's arpa2fst☆38Updated 3 months ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆77Updated 3 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 4 years ago
- A Convolutional Neural Network based Voice Activity Detector for Smartphones☆70Updated 6 years ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- Implementation of audio degradation processes☆105Updated 10 years ago
- Yin pitch estimator in PyTorch☆117Updated 3 years ago
- python wrapper for kaldi's native I/O☆28Updated 11 months ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Updated 4 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆53Updated 3 years ago
- simple dnn based vad☆70Updated 7 years ago
- ☆14Updated 3 years ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 6 years ago
- A light-weight Python library for computing Kaldi-style acoustic features based on NumPy☆14Updated 5 years ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆94Updated 2 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆43Updated 3 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Updated 3 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Updated 5 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Updated 2 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆148Updated 6 months ago
- ☆27Updated 3 years ago
- A simple package for Guided source separation (GSS)☆131Updated last year
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆42Updated 3 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Updated 2 years ago