E2E-SincNet: Toward fully end-to-end speech recognition
☆30Feb 1, 2020Updated 6 years ago
Alternatives and similar repositories for E2E-SincNet
Users that are interested in E2E-SincNet are comparing it to the libraries listed below
Sorting:
- ☆76Mar 18, 2022Updated 3 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Sep 5, 2023Updated 2 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 4 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- PyTorch CTC Decoder bindings☆14Nov 2, 2017Updated 8 years ago
- Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch☆58May 3, 2020Updated 5 years ago
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Jun 8, 2022Updated 3 years ago
- Non-Autoregressive Predictive Coding☆51Nov 3, 2020Updated 5 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Jun 15, 2019Updated 6 years ago
- ☆15Apr 20, 2018Updated 7 years ago
- Auto Segmentation Criterion (ASG) implemented in pytorch☆51Oct 1, 2021Updated 4 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆368Oct 12, 2021Updated 4 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Jan 14, 2021Updated 5 years ago
- BUT Multilingual Bottleneck Features☆15Mar 22, 2019Updated 6 years ago
- ☆15Jun 17, 2019Updated 6 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- Download and preperation tool for free speech corpora.☆16Apr 28, 2019Updated 6 years ago
- A lightweight real-time captioning application for macOS, powered by whisper.cpp and DeepSeek-V3.☆24Oct 11, 2025Updated 4 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Jan 5, 2026Updated last month
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 2 years ago
- Memory efficient transducer loss computation☆69Jun 10, 2022Updated 3 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆39Jun 9, 2020Updated 5 years ago
- experiments with RETURNN☆161Feb 7, 2026Updated 3 weeks ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- ☆18Feb 9, 2020Updated 6 years ago
- Deep learning based speech source separation using Pytorch☆319Nov 20, 2020Updated 5 years ago
- ☆48Jan 8, 2021Updated 5 years ago
- A system works on singing voice synthesis☆79Jan 11, 2023Updated 3 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- (Pytorch and Tensorflow) Implementation of Weighted Contrastive Loss (Deep Metric Learning by Online Soft Mining and Class-Aware Attentio…☆21Oct 21, 2019Updated 6 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Feb 27, 2020Updated 6 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆940Sep 4, 2024Updated last year
- ☆52Oct 17, 2023Updated 2 years ago
- Recurrent Neural Aligner☆51Apr 14, 2020Updated 5 years ago
- A library for speech data augmentation in time-domain☆683Aug 30, 2021Updated 4 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆56Jul 6, 2023Updated 2 years ago
- Implementation of a speaker identification and a speaker verification system based on Gaussian Mixture Models (GMM) in combination with a…☆21Mar 1, 2018Updated 8 years ago