E2E-SincNet: Toward fully end-to-end speech recognition
☆30Feb 1, 2020Updated 6 years ago
Alternatives and similar repositories for E2E-SincNet
Users that are interested in E2E-SincNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆76Mar 18, 2022Updated 4 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Sep 5, 2023Updated 2 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- ☆15Apr 20, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- PyTorch CTC Decoder bindings☆14Nov 2, 2017Updated 8 years ago
- A lightweight real-time captioning application for macOS, powered by whisper.cpp and DeepSeek-V3.☆24Oct 11, 2025Updated 5 months ago
- Non-Autoregressive Predictive Coding☆51Nov 3, 2020Updated 5 years ago
- Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch☆58May 3, 2020Updated 5 years ago
- BUT Multilingual Bottleneck Features☆15Mar 22, 2019Updated 7 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Jun 8, 2022Updated 3 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆369Oct 12, 2021Updated 4 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- experiments with RETURNN☆161Feb 7, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆15Jun 17, 2019Updated 6 years ago
- ☆52Oct 17, 2023Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Jan 5, 2026Updated 2 months ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- Recurrent Neural Aligner☆51Apr 14, 2020Updated 5 years ago
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- Memory efficient transducer loss computation☆70Jun 10, 2022Updated 3 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆221Jan 14, 2021Updated 5 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Jun 15, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆24Jun 17, 2020Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- Auto Segmentation Criterion (ASG) implemented in pytorch☆51Oct 1, 2021Updated 4 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆940Sep 4, 2024Updated last year
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆31Jun 17, 2024Updated last year
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆56Jul 6, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆18Feb 9, 2020Updated 6 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,236Apr 28, 2021Updated 4 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- A library for speech data augmentation in time-domain☆684Aug 30, 2021Updated 4 years ago
- ☆11Feb 19, 2021Updated 5 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago